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ASSAYS FOR THE DIRECT MEASUREMENT OF GENE DOSAGE 

FIELD OF THE INVENTION 

The present invention relates to compositions and methods for the detection and 
5 quantification of aneuploidy, and of variations in gene dosage. In particular, the present 
invention relates to compositions, methods, and kits for quantifying variations in gene 
dosage in a homogeneous reaction without the need for target amplification, fragment size 
resolution, or microscopy. Still more particularly, the present invention relates to 
compositions, methods and kits for using invasive cleavage structure assays {e.g., the 
10 INVADER assay) to screen nucleic acid samples, e.g., from patients, for the presence of 
variations in gene copy number, e.g., of individual genes or of chromosomes or portions of 
chromosomes. The present invention also relates to compositions, methods and kits for 
gene dosage in a single reaction container. 

1 5 BACKGROUND OF THE INVENTION 

Variations in gene dosage are clinically significant indicators of disease states. Such 
variations arise due to errors in DNA replication and can occur in germ line cells, leading to 
congential defects and even embryonic demise, or in somatic cells, often resulting in cancer. 
These replication anomalies can cause deletion or duplication of parts of genes, full-length 
20 genes and their surrounding regulatory regions, megabase-long portions of chromosomes, or 
entire chromosomes. 

Single-gene copy number abnormalities often play a role in cancer biology, typically 
by altering the level of expression of a key gene product, such as a tumor suppressor, 
transcription factor, or membrane receptor. Such increased or decreased expression can in 

25 turn affect cancer development, progression, and response to treatment. For example, 

amplification of the her 2/neu gene, which occurs in 20-30% of breast cancer cases and can 
range in magnitude from single copy to more than 20 copies per chromosome (described in 
US Patent No. 4,968,603), accelerates cancer progression and relapse, decreases survival 
time, and alters response to therapeutic treatments (Konigshoff, M. et al., Clinical 

30 Chemistry 49:2, 219-229 (2003)). 

Chromosomal abnormalities affect gene dosage on a larger scale and can affect 
either the number or structure of chromosomes. Conditions wherein cells, tissues, or 
individuals have one or more whole chromosomes or segments of chromosomes either 
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absent, or in addition to the normal euploid complement of chromosomes can be referred to 
as aneuploidy. Germline replication errors due to chromosome non-disjunction result in 
either monosomies (one copy of an autosomal chromosome instead of the usual two or only 
one sex chromosome) or trisomies (three copies). Such events, when they do not result in 
5 outright embryonic demise, typically lead to a broad array of disorders often recognized as 
syndromes, e.g., trisomy 21 and Down's syndrome, trisomy 18 and Edward's syndrome, 
and trisomy 13 and Patau's syndrome. Structural chromosome abnormalities affecting parts 
of chromosomes arise due to chromosome breakage, and result in deletions, inversions, 
translocations or duplications of large blocks of genetic material. These events are often as 
10 devastating as the gain or loss of the entire chromosome and can lead to such disorders as 
Prader-Willi syndrome (del 1 5q 1 1 - 1 3), retinoblastoma (del 1 3ql 4), Cri du chat syndrome 
(del 5p), and others listed in US Patent No. 5,888,740, herein incorporated in its entirety by 
reference. 

When chromosomal abnormalities arise in somatic cells, for example as the result of 

15 acquired mutations such as loss of heterozygosity (LOH) or gene duplication, they are often 
associated with cancer. For example, loss of all or part of chromosome 9 is associated with 
progression of bladder cancer (Tsukamoto, M. et al. Cancer Genetics and Cytogenetics 134, 
41-45 (2002)). Chromosome abnormalities often accumulate throughout tumor development 
and are associated with progressively worse prognoses, for example, amplification of a 

20 region of chromosome 20 can be used as a prognostic indicator of breast cancer (described 
in US Patent No. 6,268,184). 

A number of methods have been developed to detect variations in gene and 
chromosome copy number. Applications for such methods include prenatal screening, 
preimplantation genetic diagnosis (PGD), cancer screening, and tumor analysis. The first 

25 developed and still most widely used methods, generally classified as "cytogenic" methods 
involve microscopic visualization of chromosomes. The pioneering cytogenetic method is a 
technique for staining condensed chromosomes, termed "karyotyping," and first described 
in the early 1960's (reviewed in McNeil, N. and Ried, T., Expert Reviews in Molecular 
Medicine 14: 1-14, September (2000)). The stained chromosomes are analyzed for overall 

30 shape, total number, variations of chromosomal regions and for anomalies (Seabright, M. 
Lancet: 1, 967 (1972), Caspersson, T., et al Exp. Cell Res. 60: 315-319 (1970)). 
Karyotyping remains the gold standard and is often still the method of choice in cytogenetic 
laboratories. While such analysis can provide definitive evidence of trisomy, monosomy 
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and some large-scale structural abnormalities such as loss of most or all of a chromosome 
arm, classic karyotyping nonetheless suffers from numerous limitations, particularly when 
applied in a clinical or diagnostic setting. Primary among these is limited resolution. The 
smallest changes typically discernable using classic karyotyping methods are on the order of 
5 10 MB, i.e., roughly the width of a Giemsa-stained band. Furthermore, this type of analysis 
is not generally informative with regard to chromosomal translocations (Tyessier, J.R. 
Cancer Genet. Cytogenet. 37:103 (1989). In addition, traditional stain-based karyotyping is 
limited to certain applications relates to the types of samples suitable for analysis. 
Typically, large numbers of living, dividing cells, e.g., in culture, are required; the approach 

10 is thus not suitable for archived samples. Finally, conventional karyotype analysis is time 
consuming, labor intensive, and requires a high degree of skill. 

Molecular cytogenetic techniques are distinguished from classic karyotyping by 
their reliance on nucleic acid hybridization, in lieu of pure chemical staining, to visualize 
select chromosomal regions. Among the first such methods developed was Comparative 

1 5 Genomic Hybridization, or CGH, (described in US Patent No. 6,1 59,685 and related 

applications, herein incorporated in their entirety by reference). In CGH, genomic DNA is 
isolated from one or more test samples {e.g., tumor cells, embryos) and from a reference 
sample {e.g., a healthy cell). Each DNA preparation is labeled with a distinguishable label, 
such as fluorescent dyes having different absorption/emission spectra. By comparing 

20 different ratios across or within given chromosomes, this method can be used to compare 
copy numbers of different sequences within a single sample or between samples. A key 
advantage of CGH relative to classic karyotyping is its suitability for using archived, 
formalin-fixed paraffin-embedded specimens (Struski, S. et al., Cancer Genetics and 
Cytogenetics 135: 63-90 (2002)). However, as with conventional karyotyping, while 

25 accurate for analyzing chromosomal copy number abnormalities, CGH has limited 

resolution of deletions and amplifications, on the order of 3-20 Mb (Struski, S., supra and 
Lichter, P. J. Mol. Diagn. 2: 171-173 (2000)). 

An alternative molecular cytogenetic method with enhanced resolution relative to 
CGH is fluorescence in situ hybridization, or FISH, in which nucleic acid probes, often 

30 several kb in length, are labeled with fluorophores and hybridized to isolated chromosomes, 
(described in US Patent Nos. 5,663,319, 6,300,066 and related applicatons and reviewed in 
McNeil and Ried, supra and Tepperberg, J. et al. Prenat. Diagn. 21: 293-301 (2001)). In 
some cases, efforts are made to select probes that are specific for indivdual chromosomes. 
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The resulting hybrids are viewed through a microscope and analyzed for the extent and 
location of fluorescent signal. A related method is chromosome painting, described in US 
Patent Nos. 6,255,465, 6,270,971 and related patents, herein incorporated by reference. 
This method involves hybridizing to a genomic DNA sample a multiplicity of different 
5 labeled chromosome-specific probes, prepared by isolating chromosomes, usually by flow 
cytometry. Individual chromosomes are then visualized by fluorescence microscopy. 

Another method involves comparing sample chromosomal DNA to a reference 
based on the presence or absence of restriction fragment length polymorphisms, either by 
restriction endonuclease digestion or PCR amplification, followed in each case by 

1 0 hybridization to labeled probes comprising the polymorphic site, as described in US Patent 
Nos 5,380,645, 5,580,729 and related applications). 

Efforts to develop still more rapid and higher throughput methods for analyzing 
aberrations in chromosome copy number and gene dosage have led to the development of 
PCR-based approaches. Various quantitative PCR strategies have been applied to the 

15 determination of copy number, including real time fluorescence PCR (e.g., that described in 
US Patent No. 6,180,349), quantitative fluorescence PCR (QF-PCR) {e.g., Bili, C. et al. 
Prenat. Diagn. 22: 360-365 (2002)), quantitative PCR (Q-PCR) using internal controls 
selected to match the target sequence in length and GC content (e.g., US Patent No. 
5,888,740). QF-PCR methods have been developed for detecting aneuploidy of 

20 chromosomes 13, 18, 21, X and Y using highly polymorphic, chromosome-specific short 
tandem repeats (STRs) as chromosome-specific markers (described in Findlay, I. et al. J. 
Assist. Reprod. Genet. 15: 266-75 (1998); Cirigliano, V. et al., Ann. Hum. Genet. 65: 421- 
427 (2001) and Cirigliano, V. et al. Prenat. Diagn. 19: 1099-1 103 (1999)). Such methods 
have also been used to test for selected translocations (Adinolfi, M. and Sherlock, J., Lancet 

25 358: 1030-1 (2001)) and for deletions, duplications and gene dosage (Ruiz-Ponte, C. et al. 
Clinical Chem. 46: 1574-1582 (2000); Poropat, R.A. and Nicholson, G. A., Clinical Chem. 
44: 724-730 (1988); and Konigshoff, M. et al., supra.) 

Other PCR-based methods target non-repetitive, gene-based sequences. Rahil et al. 
described a QF-PCR method directed to various genie regions on chromosomes . 13, 18, and 

30 21 Rahil, H. et al., European J Hum Gen 10:462-466 (2002). An alternative approach, 
termed Multiplex Ligation-dependent Probe Amplification (MLPA) involves ligation of 
chromosome-specific probes comprising distinct "tails", which are subsequently PCR- 
amplified to yield fragments of specific length indicative of the presence of the target being 
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detected. These amplified fragments are then separated by size to indicate which of the 
probed chromosomal regions are present, absent, or duplicated (Schouten, J.P., et al. 9 
Nucleic Acids Res. 30: e 57 (2002)). In general, such PCR-based methods have the 
advantage of being applicable to a variety of biological sample types, including blood, 
5 cultured amniocytes, amniotic fluid, urine, etc. They are also more amenable to high 

throughput analysis and execution by machines or technicians than are cytogenetic methods 
requiring microscopic analysis. Results obtained using such methods are often available in 
a matter of hours or days. However, it is typically necessary to analyze multiple loci per 
chromosome with such approaches, since any homozygosity or preferential amplification of 

1 0 only one allele of a locus may occur, affecting interpretation of the results (Adinolfi, M. and 
Sherlock, J., ibid). Moreover, because of the dangers of false positive reactions, these PCR- 
based procedures require rigid controls to prevent contamination and carry over (Ehrlich et 
aL, in PCR-Based Diagnostics in Infectious Diseases, Ehrlich and Greenberg (eds), 
Blackwell Scientific Publications, [1994], pp.3- 18). 

15 Therefore, there exists a need for a rapid and quantitative detection assay for 

measuring aneuploidy and gene dosage directly, without the need without the need for 
target amplification, fragment size resolution, or microscopy. 

SUMMARY OF THE INVENTION 

20 The present invention provides compositions and methods for the detection and 

characterization of mutations resulting in alterations in gene dosage. More particularly, the 
present invention provides compositions, methods and kits for using invasive cleavage 
structure assays (e.g., the INVADER assay) to screen nucleic acid samples, e.g., from 
patients, for the presence of variations resulting in changes in gene copy number. The 

25 present invention also provides compositions, methods and kits for screening patient 
samples in a single reaction container. 

In some embodiments, the present invention provides a method for selecting a 
chromosome-specific oligonucleotide sequence, comprising identifying a chromosome- 
specific genie sequence that is unique in a genome, identifying an exon tag sequence within 

30 the genie sequence, wherein the exon tag sequence is compared to the genome to determine 
that the exon tag sequence is unique within the genome, and selecting an oligonucleotide 
sequence complementary to the exon tag or its complement. In some preferred 
embodiments, the exon tag sequence within the genie sequence is less than 100 base pairs in 
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length. In some particularly preferred embodiments, the exon tag sequence within the genie 
sequence is 91 base pairs in length. In other preferred embodiments, the exon tag sequence 
is the length of an entire exon. 

In some embodiments, the selection of an oligonucleotide sequence complementary 
5 to the exon tag or its complement comprises selecting an oligonucleotide sequence having 
20% to 70%, and preferably 40-60% GC content. 

Some embodiments of the present invention provide a method for detecting 
aneuploidy of a chromosome in a subject, comprising the steps of: a) selecting an exon tag 
sequence for the chromosome; b) providing a non-amplifying oligonucleotide detection 
10 assay configured to detect the exon tag sequence or its complement; and c) detecting the 
exon tag with the non-amplifying oligonucleotide detection assay. 

In some embodiments, the selecting of an exon tag sequence comprises the steps of: 
a) identifying a genie sequence that is specific to the chromosome in the subject, and that is 
unique in the genome of the species of the subject; and b) identifying an exon tag sequence 
15 within the genie sequence, wherein the exon tag sequence is compared to the genome to 
determine that the exon tag sequence is unique within the genome of the species of the 
subject. 

In some embodiments, the method of the present invention further comprises 
providing an internal control and a non-amplifying oligonucleotide detection assay 

20 configured to detect the internal control, wherein the internal control target is detected using 
the non-amplifying oligonucleotide detection assay configured to detect the internal control. 

In some embodiments the present invention provides a method for detecting 
aneuploidy of a chromosome in a subject, comprising the steps of: a.selecting an exon tag 
sequence for the chromosome; b) providing a non-amplified oligonucleotide detection assay 

25 configured to detect the exon tag sequence or its complement; and c) detecting the exon tag 
with the non-amplified oligonucleotide detection assay. 

In some preferred embodiments, the selecting of an exon tag sequence comprises the 
steps of: a) identifying a genie sequence that is specific to the chromosome in the subject, 
and that is unique in the genome of the species of the subject; b) identifying an exon tag 

30 sequence within the genie sequence, wherein the exon tag sequence is compared to the 

genome to determine that the exon tag sequence is unique within the genome of the species 
of the subject. 
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In some preferred embodiments, the method further comprises providing an internal 
control and a non-amplifying oligonucleotide detection assay configured to detect the 
internal control, wherein the internal control target is detected using the non-amplifying 
oligonucleotide detection assay configured to detect the internal control. In some 
5 particularly preferred embodiments, the internal control comprises a sequence from a gene 
on chromosome 1 . 

In some embodiments of the methods of the present invention, the chromosome in a 
subject is selected from the group consisting of chromosomes 13, 18, 21, X and Y. In some 
embodiments of the methods of the present invention, the exon tag sequence is contained in 

1 0 a sample type including, but not limited to amniocyte cells or cell culture, amniotic fluid, 
placental tissue (villi) obtained by CVS techniques, or other tissues/cells of embryonic 
origin (e.g., including, but not limited to, cystic hygroma fluid, fetal urine, fetal skin, and 
fetal blood). In some embodiments of methods of the present invention, DNA (is) isolated 
from maternal plasma or serum and fetal DNA present in the sample is assayed. 

1 5 In some embodiments of methods of the present invention, DNA (is) isolated from 

maternal plasma or serum and fetal DNA present in the sample is assayed. The present 
invention provides kits for the determination of aneuploidy. In some embodiments, kits of 
the present invention comprise a non-amplified oligonucleotide detection assay configured 
for detecting at least one exon tag. In some embodiments of kits, the non-amplified 

20 oligonucleotide detection assay comprises first and second oligonucleotides configured to 
form an invasive cleavage structure in combination with a target sequence comprising the at 
least one exon tag. In some preferred embodiments, the first oligonucleotide comprises a 5* 
portion and a 3' portion, wherein the 3* portion is configured to hybridize to the target 
sequence, and wherein the 5 f portion is configured to not hybridize to the target sequence. 

25 In some particularly preferred embodiments, the second oligonucleotide comprises a 5' 
portion and a 3' portion, wherein the 5' portion is configured to hybridize to the target 
sequence, and wherein the 3' terminal nucleotide is configured to hybridize or not hybridize 
to the target sequence. 

In some preferred embodiments, the kit of the present invention is configured to 

30 detect an exon tag from a gene on a targeted chromosome (13, 18, 21, X and Y) including, 
but not limited to, DSCR9, DLEU1, FLJ23403, PFKFB1, NRIP1, SRY, PCDH9, CN2, 
PRKY, HLCS, MTMR8, FLJ21 174, or PCTK1. In still other embodiments, the kit of the 
present invention comprises an internal control. In particularly preferred embodiments, the 
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internal control comprises a sequence from genes on chromosome 1 (e.g., ACTA1 and 
HIST2HBE). 

DEFINITIONS 

5 To facilitate an understanding of the present invention, a number of terms and 

phrases are defined below: 

The term "gene dosage" as used herein refers to the copy number of a gene, a genie 
region, a chromosome, or fragments or portions thereof. Normal individuals carry two 
copies of most genes or genie regions, one on each of two chromosomes. However, there 
10 are certain exceptions, e.g., when genes or genie regions reside on the X or Y chromosomes, 
or when genes sequences are present in pseudogenes. 

The term "aneuploidy" as used herein refers to conditions wherein cells, tissues, or 
individuals have one or more whole chromosomes or segments of chromosomes either 
absent, or in addition to the normal euploid complement of chromosomes. 
15 The term "gene" refers to a DNA sequence that comprises control and coding 

sequences necessary for the production of an RNA having a non-coding function {e.g., a 
ribosomal or transfer RNA), a polypeptide or a precursor. The RNA or polypeptide can be 
encoded by a full length coding sequence or by any portion of the coding sequence so long 
as the desired activity or function is retained. 
20 The term "genie region" as used herein refers to a gene, its exons, its introns, and its 

regions flanking it upstream and downstream, e.g., 5 tolO kilobases 5' and 3* of the 
transcription start and stop sites, respectively. 

The term "genie sequence" as used herein refers to the sequence of a gene, its 
introns, and its regions flanking it upstream and downstream, e.g., 5 to 10 kilobases 5' and 3' 
25 of the transcription start and stop sites, respectively. 

The term "chromosome-specific" as used herein refers to a sequence that is found 
only in that particular type of chromosome. 

The term "exon tag" as used herein refers to a chromosome-specific sequence in the 
exon of a gene that is also unique in the genome. 
30 As used herein, the terms "subject" and "patient" refer to any organisms including 

plants, microorganisms and animals (e.g., mammals such as dogs, cats, livestock, and 
humans). 
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As used herein, the term "INVADER assay reagents" refers to one or more reagents 
for detecting target sequences, said reagents comprising oligonucleotides capable of 
forming an invasive cleavage structure in the presence of the target sequence. In some 
embodiments, the INVADER assay reagents further comprise an agent for detecting the 
5 presence of an invasive cleavage structure (e.g., a cleavage agent). In some embodiments, 
the oligonucleotides comprise first and second oligonucleotides, said first oligonucleotide 
comprising a 5 r portion complementary to a first region of the target nucleic acid and said 
second oligonucleotide comprising a 3' portion and a 5* portion, said 5' portion 
complementary to a second region of the target nucleic acid downstream of and contiguous 

10 to the first portion. In some embodiments, the 3' portion of the second oligonucleotide 
comprises a 3* terminal nucleotide not complementary to the target nucleic acid. In 
preferred embodiments, the 3' portion of the second oligonucleotide consists of a single 
nucleotide not complementary to the target nucleic acid. 

In some embodiments, INVADER assay reagents are configured to detect a target 

15 nucleic acid sequence comprising first and second non-contiguous single-stranded regions 
separated by an intervening region comprising a double-stranded region. In preferred 
embodiments, the INVADER assay reagents comprise a bridging oligonucleotide capable of 
binding to said first and second non-contiguous single-stranded regions of a target nucleic 
acid sequence. In particularly preferred embodiments, either or both of said first or said 

20 second oligonucleotides of said INVADER assay reagents are bridging oligonucleotides. 

In some embodiments, the INVADER assay reagents further comprise a solid 
support. For example, in some embodiments, the one or more oligonucleotides of the assay 
reagents (e.g., first and/or second oligonucleotide, whether bridging or non-bridging) is 
attached to said solid support. In some embodiments, the INVADER assay reagents further 

25 comprise a buffer solution. In some preferred embodiments, the buffer solution comprises a 
source of divalent cations (e.g., Mn2+ and/or Mg2+ ions). Individual ingredients (e.g., 
oligonucleotides, enzymes, buffers, target nucleic acids) that collectively make up 
INVADER assay reagents are termed "INVADER assay reagent components". 

In some embodiments, the INVADER assay reagents further comprise a third 

30 oligonucleotide complementary to a third portion of the target nucleic acid upstream of the 
first portion of the first target nucleic acid. In yet other embodiments, the INVADER assay 
reagents further comprise a target nucleic acid. In some embodiments, the INVADER assay 
reagents further comprise a second target nucleic acid. In yet other embodiments, the 



WO 2005/007869 



PCT/US2004/022014 



INVADER assay reagents further comprise a third oligonucleotide comprising a 5 f portion 
complementary to a first region of the second target nucleic acid. In some specific 
embodiments, the 3' portion of the third oligonucleotide is covalently linked to the second 
target nucleic acid. In other specific embodiments, the second target nucleic acid further 
5 comprises a 5* portion, wherein the 5* portion of the second target nucleic acid is the third 
oligonucleotide. In still other embodiments, the INVADER assay reagents further comprise 
an ARRESTOR molecule (e.g., ARRESTOR oligonucleotide). 

In some preferred embodiments, the INVADER assay reagents further comprise 
reagents for detecting a nucleic acid cleavage product. In some embodiments, one or more 

10 oligonucleotides in the INVADER assay reagents comprise a label. In some preferred 

embodiments, said first oligonucleotide comprises a label. In other preferred embodiments, 
said third oligonucleotide comprises a label. In particularly preferred embodiments, the 
reagents comprise a first and/or a third oligonucleotide labeled with moieties that produce a 
fluorescence resonance energy transfer (FRET) effect. 

15 In some embodiments one or more the INVADER assay reagents may be provided 

in a predispensed format (i.e., premeasured for use in a step of the procedure without re- 
measurement or re-dispensing). In some embodiments, selected INVADER assay reagent 
components are mixed and predispensed together. In preferred embodiments, predispensed 
assay reagent components are predispensed and are provided in a reaction vessel (including 

20 but not limited to a reaction tube or a well, as in, e.g., a microtiter plate). In particularly 
preferred embodiments, predispensed INVADER assay reagent components are dried down 
(e.g., desiccated or lyophilized) in a reaction vessel. 

In some embodiments, the INVADER assay reagents are provided as a kit. As used 
herein, the term "kit" refers to any delivery system for delivering materials. In the context 

25 of reaction assays, such delivery systems include systems that allow for the storage, 
transport, or delivery of reaction reagents (e.g., oligonucleotides, enzymes, etc. in the 
appropriate containers) and/or supporting materials (e.g., buffers, written instructions for 
performing the assay etc.) from one location to another. For example, kits include one or 
more enclosures (e.g., boxes) containing the relevant reaction reagents and/or supporting 

30 . materials. As used herein, the term "fragmented kit" refers to delivery systems comprising 
two or more separate containers that each contains a subportion of the total kit components. 
The containers may be delivered to the intended recipient together or separately. For 
example, a first container may contain an enzyme for use in an assay, while a second 

10 
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container contains oligonucleotides. The term "fragmented kit" is intended to encompass 
kits containing Analyte specific reagents (ASR's) regulated under section 520(e) of the 
Federal Food, Drug, and Cosmetic Act,. but is not limited thereto. Indeed, any delivery 
system comprising two or more separate containers that each contains a subportion of the 
5 total kit components are included in the term "fragmented kit." In contrast, a "combined 
kit" refers to a delivery system containing all of the components of a reaction assay in a 
single container (e.g., in a single box housing each of the desired components). The term 
"kit" includes both fragmented and combined kits. 

In some embodiments, the present invention provides INVADER assay reagent kits 

10 comprising one or more of the components necessary for practicing the present invention. 
For example, the present invention provides kits for storing or delivering the enzymes 
and/or the reaction components necessary to practice an INVADER assay. The kit may 
include any and all components necessary or desired for assays including, but not limited to, 
the reagents themselves, buffers, control reagents {e.g., tissue samples, positive and negative 

15 control target oligonucleotides, etc.), solid supports, labels, written and/or pictorial 

instructions and product information, inhibitors, labeling and/or detection reagents, package 
environmental controls {e.g., ice, desiccants, etc.), and the like. In some embodiments, the 
kits provide a sub-set of the required components, wherein it is expected that the user will 
supply the remaining components. In some embodiments, the kits comprise two or more 

20 separate containers wherein each container houses a subset of the components to be 

delivered. For example, a first container {e.g., box) may contain an enzyme {e.g., structure 
specific cleavage enzyme in a suitable storage buffer and container), while a second box 
may contain oligonucleotides {e.g., INVADER oligonucleotides, probe oligonucleotides, 
control target oligonucleotides, etc.). 

25 The term "label" as used herein refers to any atom or molecule that can be used to 

provide a detectable (preferably quantifiable) effect, and that can be attached to a nucleic 
acid or protein. Labels include but are not limited to dyes; radiolabels such as 32 P; binding 
moieties such as biotin; haptens such as digoxgenin; luminogenic, phosphorescent or 
fluorogenic moieties; mass tags; and fluorescent dyes alone or in combination with moieties 

30 that can suppress or shift emission spectra by fluorescence resonance energy transfer 

(FRET). Labels may provide signals detectable by fluorescence, radioactivity, colorimetry, 
gravimetry, X-ray diffraction or absorption, magnetism, enzymatic activity, characteristics 
of mass or behavior affected by mass {e.g., MALDI time-of- flight mass spectrometry), and 

11 
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the like. A label may be a charged moiety (positive or negative charge) or alternatively, 
may be charge neutral. Labels can include or consist of nucleic acid or protein sequence, so 
long as the sequence comprising the label is detectable. 

As used herein, the term "distinct" in reference to signals refers to signals that can be 
5 differentiated one from another, e.g., by spectral properties such as fluorescence emission 
wavelength, color, absorbance, mass, size, fluorescence polarization properties, charge, etc., 
or by capability of interaction with another moiety, such as with a chemical reagent, an 
enzyme, an antibody, etc. 

As used herein, the terms "complementary" or "complementarity" are used in 

10 reference to polynucleotides (i.e., a sequence of nucleotides such as an oligonucleotide or a 
target nucleic acid) related by base-pairing rules. For natural bases, the base pairing rules 
are generally those developed by Watson and Crick. For example, for the sequence " 5 f -A- 
G-T-3V is complementary to the sequence " 3'-T-C-A-5\" For non-natural bases, base- 
pairing rules include the formation of hydrogen bonds in a manner similar to the Watson- 

1.5 Crick base pairing rules (for example, having similar features such as geometries or bond 
angles, as described, e.g., by Kunkel, et al, Annu Rev Biochem. 69:497-529 (2000), 
incorporated herein by reference). Complementarity may be "partial," in which only some 
of the nucleic acids' bases are matched according to base pairing rules. Or, there may be 
"complete" or "total" complementarity between the nucleic acids. The degree of 

20 complementarity between nucleic acid strands has significant effects on the efficiency and 
strength of hybridization between nucleic acid strands. This is of particular importance in 
amplification reactions, as well as detection methods that depend upon binding between 
nucleic acids. Either term may also be used in reference to individual nucleotides, 
especially within the context of polynucleotides. For example, a particular nucleotide 

25 within an oligonucleotide may be noted for its complementarity, or lack thereof, to a 
nucleotide within another nucleic acid strand, in contrast or comparison to the 
complementarity between the rest of the oligonucleotide and the nucleic acid strand. 
Nucleotide analogs used to form non-standard base pairs, whether with another nucleotide 
analog (e.g., an IsoC/IsoG base pair), or with a naturally occurring nucleotide (e.g., as 

30 described in U.S. Patent 5,912,340, herein incorporated by reference in its entirety) are also 
considered to be complementary to a base pairing partner within the meaning this definition. 
Further, when nucleotides are known to form pairs with multiple different bases, e.g., the 
IsoG nucleotide's ability to pair with IsoC and with T nucleotides, each of the bases with 

12 
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which it can form a hydrogen-bonded base-pair falls within the meaning of 
"complementary," as used herein. "Universal" bases, i.e., those that can form base pairs 
with several other bases, such as the "wobble" base inosine, are considered complementary 
to those bases with which pairs can be formed. 
5 The complement of a nucleic acid sequence as used herein refers to an 

oligonucleotide which, when aligned with the nucleic acid sequence such that the 5' end of 
one sequence is paired with the 3' end of the other, is in "antiparallel association." Certain 
bases not commonly found in natural nucleic acids may be included in the nucleic acids of 
the present invention and include, for example, inosine and 7-deazaguanine. 

1 0 Complementarity need not be perfect; stable duplexes may contain mismatched base pairs 
or unmatched bases. Those skilled in the art of nucleic acid technology can determine 
duplex stability empirically considering a number of variables including, for example, the 
length of the oligonucleotide, base composition and sequence of the oligonucleotide, ionic 
strength and incidence of mismatched base pairs. 

1 5 The term "homology" and "homologous" refers to a degree of identity. There may 

be partial homology or complete homology. A partially homologous sequence is one that is 
less than 100% identical to another sequence. 

As used herein, the term "hybridization" is used in reference to the pairing of 
complementary nucleic acids. Hybridization and the strength of hybridization (i.e., the 

20 strength of the association between the nucleic acids) is influenced by such factors as the 
degree of complementary between the nucleic acids, stringency of the conditions involved, 
and the T m of the formed hybrid. "Hybridization" methods involve the annealing of one 
nucleic acid to another, complementary nucleic acid, i.e., a nucleic acid having a 
complementary nucleotide sequence. The ability of two polymers of nucleic acid 

25 containing complementary sequences to find each other and anneal through base pairing 
interaction is a well-recognized phenomenon. The initial observations of the 
"hybridization" process by Marmur and Lane, Proc. Natl. Acad. Sci. USA 46:453 (1960) 
and Doty et al., Proc. Natl. Acad. Sci. USA 46:461 (1960) have been followed by the 
refinement of this process into an essential tool of modern biology. 

30 As used herein, the term " T m " is used in reference to the "melting temperature." 

The melting temperature is the temperature at which a population of double-stranded 
nucleic acid molecules becomes half dissociated into single strands. Several equations for 
calculating the T m of nucleic acids are well known in the art. As indicated by standard 
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references, a simple estimate of the T m value may be calculated by the equation: T m = 8 1 .5 
+ 0.4 1(% G + C), when a nucleic acid is in aqueous solution at 1 M NaCl (see e.g., 
Anderson and Young, Quantitative Filter Hybridization, in Nucleic Acid Hybridization 
(1985). Other references {e.g., Allawi, H.T. & SantaLucia, J., Jr. Thermodynamics and 
5 NMR of internal G.T mismatches in DNA. Biochemistry 36, 10581-94 (1997) include more 
sophisticated computations which take structural and environmental, as well as sequence 
characteristics into account for the calculation of T m . 

The term "wild-type" refers to a gene or a gene product that has the characteristics of 
that gene or gene product when isolated from a naturally occurring source. A wild-type 

10 gene is that which is most frequently observed in a population and is thus arbitrarily 

designated the "normal" or "wild-type" form of the gene. In contrast, the term "modified" 
,"mutant" or "polymorphic" refers to a gene or gene product which displays modifications in 
sequence and or functional properties (i.e., altered characteristics) when compared to the 
wild-type gene or gene product. It is noted that naturally-occurring mutants can be isolated; 

15 these are identified by the fact that they have altered characteristics when compared to the 
wild-type gene or gene product. 

The term "recombinant DNA vector" as used herein refers to DNA sequences 
containing a desired heterologous sequence. For example, although the term is not limited 
to the use of expressed sequences or sequences that encode an expression product, in some 

20 embodiments, the heterologous sequence is a coding sequence and appropriate DNA 

sequences necessary for either the replication of the coding sequence in a host organism, or 
the expression of the operably linked coding sequence in a particular host organism. DNA 
sequences necessary for expression in prokaryotes include a promoter, optionally an 
operator sequence, a ribosome binding site and possibly other sequences. Eukaryotic cells 

25 are known to utilize promoters, polyadenlyation signals and enhancers. 

The term "oligonucleotide" as used herein is defined as a molecule comprising two 
or more deoxyribonucleotides or ribonucleotides, preferably at least 5 nucleotides, more 
preferably at least about 10-15 nucleotides and more preferably at least about 15 to 30 
nucleotides. The exact size will depend on many factors, which in turn depend on the 

30 ultimate function or use of the oligonucleotide. The oligonucleotide may be generated in 
any manner, including chemical synthesis, DNA replication, reverse transcription, PCR, or 
a combination thereof. 
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Because mononucleotides are reacted to make oligonucleotides in a manner such 
that the 5* phosphate of one mononucleotide pentose ring is attached to the 3' oxygen of its 
neighbor in one direction via a phosphodiester linkage, an end of an oligonucleotide is 
referred to as the "5* end" if its 5* phosphate is not linked to the 3* oxygen of a 
5 mononucleotide pentose ring and as the "3' end" if its 3' oxygen is not linked to a 5* 
phosphate of a subsequent mononucleotide pentose ring. As used herein, a nucleic acid 
sequence, even if internal to a larger oligonucleotide, also may be said to have 5' and 3' 
ends. A first region along a nucleic acid strand is said to be upstream of another region if 
the 3' end of the first region is before the 5' end of the second region when moving along a 

10 strand of nucleic acid in a 5' to 3* direction. 

When two different, non-overlapping oligonucleotides anneal to different regions of 
the same linear complementary nucleic acid sequence, and the 3 f end of one oligonucleotide 
points towards the 5' end of the other, the former may be called the "upstream" 
oligonucleotide and the latter the "downstream" oligonucleotide. Similarly, when two 

15 overlapping oligonucleotides are hybridized to the same linear complementary nucleic acid 
sequence, with the first oligonucleotide positioned such that its 5' end is upstream of the 5' 
end of the second oligonucleotide, and the 3* end of the first oligonucleotide is upstream of 
the 3' end of the second oligonucleotide, the first oligonucleotide may be called the 
"upstream" oligonucleotide and the second oligonucleotide may be called the "downstream" 

20 oligonucleotide. 

The term "primer" refers to an oligonucleotide that is capable of acting as a point of 
initiation of synthesis when placed under conditions in which primer extension is initiated. 
An oligonucleotide "primer" may occur naturally, may be made using molecular biological 
methods, e.g., purification of a restriction digest, or may be produced synthetically. 

25 A primer is selected to be "substantially" complementary to a strand of specific 

sequence of the template. A primer must be sufficiently complementary to hybridize with a 
template strand for primer elongation to occur. A primer sequence need not reflect the 
exact sequence of the template. For example, a non-complementary nucleotide fragment 
may be attached to the 5' end of the primer, with the remainder of the primer sequence being 

30 substantially complementary to the strand. Non-complementary bases or longer sequences 
can be interspersed into the primer, provided that the primer sequence has sufficient 
complementarity with the sequence of the template to hybridize and thereby form a template 
primer complex for synthesis of the extension product of the primer. 
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The term "cleavage structure" as used herein, refers to a structure that is formed by 
the interaction of at least one probe oligonucleotide and a target nucleic acid, forming a 
structure comprising a duplex, the resulting structure being cleavable by a cleavage means, 
including but not limited to an enzyme. The cleavage structure is a substrate for specific 
5 cleavage by the cleavage means in contrast to a nucleic acid molecule that is a substrate for 
non-specific cleavage by agents such as phosphodiesterases which cleave nucleic acid 
molecules without regard to secondary structure (i.e., no formation of a duplexed structure 
is required). 

The term "cleavage means" or "cleavage agent" as used herein refers to any means 

10 that is capable of cleaving a cleavage structure, including but not limited to enzymes. 

"Structure-specific nucleases" or "structure-specific enzymes" are enzymes that recognize 
specific secondary structures in a nucleic molecule and cleave these structures. The 
cleavage means of the invention cleave a nucleic acid molecule in response to the formation 
of cleavage structures; it is not necessary that the cleavage means cleave the cleavage 

1 5 structure at any particular location within the cleavage structure. 

The cleavage means may include nuclease activity provided from a variety of 
sources including the Cleavase enzymes, the FEN-1 endonucleases (including RAD2 and 
XPG proteins), Taq DNA polymerase and E, coli DNA polymerase I. The cleavage means 
may include enzymes having 5' nuclease activity (e.g., Taq DNA polymerase (DNAP), E. 

20 coli DNA polymerase I). The cleavage means may also include modified DNA 
polymerases having 5' nuclease activity but lacking synthetic activity. Examples of 
cleavage means suitable for use in the method and kits of the present invention are provided 
in U.S. Patent Nos. 5,614,402; 5,795,763; 5,843,669; 6,090,606, 6,562,61 1, 6553,587; PCT 
Appln. Nos WO 98/23774; WO 02/070755; and WO01/90337, each of which is herein 

25 incorporated by reference it its entirety. 

The term "thermostable" when used in reference to an enzyme, such as a 5' nuclease, 
indicates that the enzyme is functional or active (i.e., can perform catalysis) at an elevated 
temperature, i.e., at about 55°C or higher. 

The term "cleavage products" as used herein, refers to products generated by the 

30 reaction of a cleavage means with a cleavage structure (i.e., the treatment of a cleavage 
structure with a cleavage means). 

The term "target nucleic acid" refers to a nucleic acid molecule containing a 
sequence that has at least partial complementarity with at least a probe oligonucleotide and 

16 



WO 2005/007869 



PCT/US2004/022014 



may also have at least partial complementarity with an INVADER oligonucleotide. The 
target nucleic acid may comprise single- or double-stranded DNA or RNA. 

The term "non-target cleavage product" refers to a product of a cleavage reaction 
that is not derived from the target nucleic acid. As discussed above, in the methods of the 
5 present invention, cleavage of the cleavage structure generally occurs within the probe 
oligonucleotide. The fragments of the probe oligonucleotide generated by this target 
nucleic acid-dependent cleavage are "non-target cleavage products." 

The term "probe oligonucleotide" refers to an oligonucleotide that interacts with a 
target nucleic acid to form a cleavage structure in the presence or absence of an INVADER 
10 oligonucleotide. When annealed to the target nucleic acid, the probe oligonucleotide and 
target form a cleavage structure and cleavage occurs within the probe oligonucleotide. 

The term "INVADER oligonucleotide" refers to an oligonucleotide that hybridizes 
to a target nucleic acid at a location near the region of hybridization between a probe and 
the target nucleic acid, wherein the INVADER oligonucleotide comprises a portion (e.g., a 
1 5 chemical moiety, or nucleotide — whether complementary to that target or not) that overlaps 
with the region of hybridization between the probe and target. In some embodiments, the 
INVADER oligonucleotide contains sequences at its 3' end that are substantially the same 
as sequences located at the 5' end of a probe oligonucleotide. 

The term "cassette" as used herein refers to an oligonucleotide or combination of 
20 oligonucleotides configured to generate a detectable signal in response to cleavage of a 
probe oligonucleotide in an INVADER assay. In preferred embodiments, the cassette 
hybridizes to a non-target cleavage product from cleavage of the probe oligonucleotide to 
form a second invasive cleavage structure, such that the cassette can then be cleaved. 

In some embodiments, the cassette is a single oligonucleotide comprising a hairpin 
25 portion (i.e., a region wherein one portion of the cassette oligonucleotide hybridizes to a 

second portion of the same oligonucleotide under reaction conditions, to form a duplex). In 
other embodiments, a cassette comprises at least two oligonucleotides comprising 
complementary portions that can form a duplex under reaction conditions. In preferred 
embodiments, the cassette comprises a label. In particularly preferred embodiments, 
30 cassette comprises labeled moieties that produce a fluorescence resonance energy transfer 
(FRET) effect. 

The term "substantially single-stranded" when used in reference to a nucleic acid 
substrate means that the substrate molecule exists primarily as a single strand of nucleic 
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acid in contrast to a double-stranded substrate which exists as two strands of nucleic acid 
which are held together by inter-strand base pairing interactions. 

As used herein, the phrase "non-amplified oligonucleotide detection assay" refers to 
a detection assay configured to detect the presence or absence of a particular polymorphism 
5 (e.g., SNP, repeat sequence, etc.) in a target sequence (e.g., genomic DNA) that has not 
been amplified (e.g., by PCR), without creating copies of the target sequence. A "non- 
amplified oligonucloetide detection assay" may, for example, amplify a signal used to 
indicate the presence or absence of a particular polymorphism in a target sequence, so long 
as the target sequence is not copied. 

10 As used herein, the phrase "non-amplifying oligonucleotide detection assay" refers 

to a detection assay configured to detect the presence or absence of a particular 
polymorphism (e.g., SNP, repeat sequence, etc.) in a target sequence (e.g., genomic DNA, 
or amplified or other synthetic DNA), without creating copies of the target sequence. A 
"non-amplifing oligonucloetide detection assay" may, for example, amplify a signal used to 

15 indicate the presence or absence of a particular polymorphism in a target sequence, so long 
as the target sequence is not copied. 

The term "sequence variation" as used herein refers to differences in nucleic acid 
sequence between two nucleic acids. For example, a wild-type structural gene and a mutant 
form of this wild-type structural gene may vary in sequence by the presence of single base 

20 substitutions and/or deletions or insertions of one or more nucleotides. These two forms of 
the structural gene are said to vary in sequence from one another. A second mutant form of 
the structural gene may exist. This second mutant form is said to vary in sequence from 
both the wild-type gene and the first mutant form of the gene. 

The term "liberating" as used herein refers to the release of a nucleic acid fragment 

25 from a larger nucleic acid fragment, such as an oligonucleotide, by the action of, for 

example, a 5' nuclease such that the released fragment is no longer covalently attached to 
the remainder of the oligonucleotide. 

The term "K m " as used herein refers to the Michaelis-Menten constant for an enzyme 
and is defined as the concentration of the specific substrate at which a given enzyme yields 

30 one-half its maximum velocity in an enzyme catalyzed reaction. 

The term "nucleotide analog" as used herein refers to modified or non-naturally 
occurring nucleotides including but not limited to analogs that have altered stacking 
interactions such as 7-deaza purines (i.e., 7-deaza-dATP and 7-deaza-dGTP); base analogs 
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with alternative hydrogen bonding configurations (e.g., such as Iso-C and Iso-G and other 
non-standard base pairs described in U.S. Patent No. 6,001,983 to S. Benner); non-hydrogen 
bonding analogs (e.g., non-polar, aromatic nucleoside analogs such as 2,4-difluorotoluene, 
described by B.A. Schweitzer and E.T. Kool, J. Org. Chem., 1994, 59, 7238-7242, B.A. 
5 Schweitzer and E.T. Kool, J. Am. Chem. Soc, 1995, 117, 1863-1872); "universal" bases 
such as 5-nitroindole and 3-nitropyrrole; and universal purines and pyrimidines (such as 
"K" and "P" nucleotides, respectively; P. Kong, et al, Nucleic Acids Res., 1989, 17, 10373- 
10383, P. Kong et ai, Nucleic Acids Res., 1992, 20, 5149-5152). Nucleotide analogs 
include base analogs, and comprise modified forms of deoxyribonucleotides as well as 

10 ribonucleotides, and include but are not limited to to modified bases and nucleotides 
described in U.S. Pat. Nos. 5,432,272; 6,001,983; 6,037,120; 6,140,496; 5,912,340; 
6,127,121 and 6,143,877, each of which is incorporated herein by reference in their 
entireties; heterocyclic base analogs based on the purine or pyrimidine ring systems, and 
other heterocyclic bases. 

1 5 The term "polymorphic locus" is a locus present in a population that shows variation 

between members of the population (e.g., the most common allele has a frequency of less 
than 0.95). In contrast, a "monomorphic locus" is a genetic locus at little or no variations 
seen between members of the population (generally taken to be a locus at which the most 
common allele exceeds a frequency of 0.95 in the gene pool of the population). 

20 The term "microorganism" as used herein means an organism too small to be 

observed with the unaided eye and includes, but is not limited to bacteria, virus, protozoans, 
fungi, and ciliates. 

The term "microbial gene sequences" refers to gene sequences derived from a 
microorganism. 

25 The term "bacteria" refers to any bacterial species including eubacterial and 

archaebacterial species. 

The term "virus" refers to obligate, ultramicroscopic, intracellular parasites 
incapable of autonomous replication (i.e., replication requires the use of the host cell's 
machinery). 

30 The term "multi-drug resistant" or multiple-drug resistant" refers to a microorganism 

that is resistant to more than one of the antibiotics or antimicrobial agents used in the 
treatment of said microorganism. 
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The term "sample" in the present specification and claims is used in its broadest 
sense. On the one hand it is meant to include a specimen or culture {e.g., microbiological 
cultures). On the other hand, it is meant to include both biological and environmental 
samples. A sample may include a specimen of synthetic origin. 
5 Biological samples may be animal, including human, fluid, solid (e.g., stool) or 

tissue, as well as liquid and solid food and feed products and ingredients such as dairy 
items, vegetables, meat and meat by-products, and waste. Biological samples may be 
obtained from all of the various families of domestic animals, as well as feral or wild 
animals, including, but not limited to, such animals as ungulates, bear, fish, lagamorphs, 
10 rodents, etc. 

Environmental samples include environmental material such as surface matter, soil, 
water and industrial samples, as well as samples obtained from food and dairy processing 
instruments, apparatus, equipment, utensils, disposable and non-disposable items. These 
examples are not to be construed as limiting the sample types applicable to the present 
15 invention. 

The term "source of target nucleic acid" refers to any sample that contains nucleic 
acids (RNA or DNA). Particularly preferred sources of target nucleic acids are biological 
samples including, but not limited to blood, saliva, cerebral spinal fluid, pleural fluid, milk, 
lymph, sputum and semen. 

20 An oligonucleotide is said to be present in "excess" relative to another 

oligonucleotide (or target nucleic acid sequence) if that oligonucleotide is present at a 
higher molar concentration that the other oligonucleotide (or target nucleic acid sequence). 
When an oligonucleotide such as a probe oligonucleotide is present in a cleavage reaction in 
excess relative to the concentration of the complementary target nucleic acid sequence, the 

25 reaction may be used to indicate the amount of the target nucleic acid present. Typically, 
when present in excess, the probe oligonucleotide will be present at least a 100-fold molar 
excess; typically at least 1 pmole of each probe oligonucleotide would be used when the 
target nucleic acid sequence was present at about 10 fmoles or less. 

A sample "suspected of containing" a first and a second target nucleic acid may 

30 contain either, both or neither target nucleic acid molecule. 

The term "reactant" is used herein in its broadest sense. The reactant can comprise, 
for example, an enzymatic reactant, a chemical reactant or light (e.g., ultraviolet light, 
particularly short wavelength ultraviolet light is known to break oligonucleotide chains). 
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Any agent capable of reacting with an oligonucleotide to either shorten (/.e., cleave) or 
elongate the oligonucleotide is encompassed within the term "reactant." 

As used herein, the term "purified" or "to purify" refers to the removal of 
contaminants from a sample. For example, recombinant CLEAVASE nucleases are 
expressed in bacterial host cells and the nucleases are purified by the removal of host cell 
proteins; the percent of these recombinant nucleases is thereby increased in the sample. 

As used herein the term "portion" when in reference to a protein (as in "a portion of 
a given protein") refers to fragments of that protein. The fragments may range in size from 
four amino acid residues to the entire amino acid sequence minus one amino acid (e.g., 4, 5, 
6,..., n-1). 

The term "nucleic acid sequence" as used herein refers to an oligonucleotide, 
nucleotide or polynucleotide, and fragments or portions thereof, and to DNA or RNA of 
genomic or synthetic origin that may be single or double stranded, and represent the sense 
or antisense strand. Similarly, "amino acid sequence" as used herein refers to peptide or 
protein sequence. 

As used herein, the terms "purified" or "substantially purified" refer to molecules, 
either nucleic or amino acid sequences, that are removed from their natural environment, 
isolated or separated, and are at least 60% free, preferably 75% free, and most preferably 
90% free from other components with which they are naturally associated. An "isolated 
polynucleotide" or "isolated oligonucleotide" is therefore a substantially purified 
polynucleotide. 

The term "continuous strand of nucleic acid" as used herein is means a strand of 
nucleic acid that has a continuous, covalently linked, backbone structure, without nicks or 
other disruptions. The disposition of the base portion of each nucleotide, whether 
base-paired, single-stranded or mismatched, is not an element in the definition of a 
continuous strand. The backbone of the continuous strand is not limited to the 
ribose-phosphate or deoxyribose-phosphate compositions that are found in naturally 
occurring, unmodified nucleic acids. A nucleic acid of the present invention may comprise 
modifications in the structure of the backbone, including but not limited to phosphorothioate 
residues, phosphonate residues, T substituted ribose residues (e.g. 9 2'-0-methyl ribose) and 
alternative sugar (e.g., arabinose) containing residues. 

The term "continuous duplex" as used herein refers to a region of double stranded 
nucleic acid in which there is no disruption in the progression of basepairs within the duplex 
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(i.e., the base pairs along the duplex are not distorted to accommodate a gap, bulge or 
mismatch with the confines of the region of continuous duplex). As used herein the term 
refers only to the arrangement of the basepairs within the duplex, without implication of 
continuity in the backbone portion of the nucleic acid strand. Duplex nucleic acids with 
5 uninterrupted basepairing, but with nicks in one or both strands are within the definition of a 
continuous duplex. 

The term "duplex" refers to the state of nucleic acids in which the base portions of 
the nucleotides on one strand are bound through hydrogen bonding the their complementary 
bases arrayed on a second strand. The condition of being in a duplex form reflects on the 

10 state of the bases of a nucleic acid. By virtue of base pairing, the strands of nucleic acid 
also generally assume the tertiary structure of a double helix, having a major and a minor 
groove. The assumption of the helical form is implicit in the act of becoming duplexed. 

The term "template" refers to a strand of nucleic acid on which a complementary 
copy is built from nucleoside triphosphates through the activity of a template-dependent 

1 5 nucleic acid polymerase. Within a duplex the template strand is, by convention, depicted 
and described as the "bottom" strand. Similarly, the non-template strand is often depicted 
and described as the "top" strand. 

DESCRIPTION OF THE DRAWINGS 

20 The following figures form part of the present specification and are included to 

further demonstrate certain aspects and embodiments of the present invention. 
Figure 1 shows a general overview of the biplex INVADER assay. 
Figure 2 shows a general overview of the EXON TAGGER program. 
Figure 3 shows a list of design regions and oligonucleotide designs for INVADER 
25 assays for aneuploidy. All oligonucleotide sequences are shown in the 5'-3' orientation and 
all probes contain 3' hexanediol. GC content refers to the GC content of the target 91 mer. 

Figure 4 illustrates determination of the limit of detection (LOD) of assays to detect 
chromosome 21. 

Figure 5 illustrates detection of discrete regions of chromosome 13. 
30 Figure 6 illustrates detection of discrete regions of chromosome 18. ' 

Figure 7A-F illustrates detection of discrete regions of chromosome 21. 
Figure 8 illustrates detection of discrete regions of the X chromosome. 
Figure 9 illustrates detection of discrete regions of the Y chromosome. 
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Figure 10 shows a comparison of detection of the DSCR6 gene by two different 
probe sets. 

Figure 1 1 shows the results of an experiment to mimic detection of a trisomy 21 
sample contaminated with varying levels of normal, disomy DNA. 
5 Figure 12 shows a list of target regions for INVADER assays for aneuploidy. The 

INVADER assay footprint is indicated in parenthesis and the cleavage site is designated in 
brackets. 

Figure 13 illustrates determination of the limit of detection (LOD) of assays to detect 
the X chromosome. Figure 13A shows detection of PFKFB1+PCTK1 targets and Figure 
10 1 3B shows detection of MTMR8+FLJ2 1 1 74 targets. 

Figure 14 shows the results of the INVADER assay for detection of chromosome 18 
targets in samples of mixed content. 

Figure 15 shows the results of INVADER assay detection of triploidy samples. 

1 5 DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides methods for measuring the copy number of a specific 
polynucleotide sequence in a biological sample. In general, the present invention involves 
homogeneous detection of gene dosage in a single reaction vessel. In some embodiments, 
gene dosage is measured in a single reaction vessel. In some embodiments, accumulation of 

20 target-specific signal is directly correlated to the amount of the specific polynucleotide 
sequence present in the sample. 

In some embodiments, the methods of the present invention involve direct detection 
of a test polynucleotide sequence and of a control sequence. In some embodiments, the 
number of copies of a normal individual will be two, and gene dosage determinations that 

25 deviate from two will be deemed aberrant. 

In some preferred embodiments, the methods of the present invention involve the 
INVADER assay. The INVADER assay detects specific DNA and RNA sequences by 
using structure-specific enzymes (e.g., FEN endonucleases) to cleave a complex formed by 
the hybridization of overlapping oligonucleotide probes (See, e.g., Figure 1). When two 

30 strands of nucleic acid, or oligonucleotides, both hybridize to a target nucleic acid strand 

such that they form an overlapping invasive cleavage structure, as described below, invasive 
cleavage can occur. Through the interaction of a cleavage agent (e.g., a 5' nuclease) and the 
upstream oligonucleotide, the cleavage agent can be made to cleave the downstream 
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oligonucleotide at an internal site in such a way that a distinctive fragment is produced. 
Such embodiments have been termed the INVADER assay (Third Wave Technologies) and 
are described in U.S. Patent Appl. Nos. 5,846,717, 5,985,557, 5,994,069, 6,001,567, and 
6,090,543, WO 97/27214 WO 98/42873, Lyamichev et al., Nat. Biotech., 17:292 (1999), 
5 Hall et al., PNAS, USA, 97:8272 (2000), each of which is herein incorporated by reference 
in their entirety for all purposes). In preferred embodiments, elevated temperature and an 
excess of one of the probes enable multiple probes to be cleaved for each target sequence 
present without temperature cycling. The resulting cleavage products are indicative of the 
presence of specific target nucleic acid sequences in the sample. The reactions can be 
0 configured such that the amount of the cleavage product produced indicates the amount of 
the target sequence present in the reaction. Lyamichev, et al, Nature Biotech 1999, supra. 

The INVADER assay detects hybridization of probes to a target by enzymatic 
cleavage of specific structures by structure specific enzymes (See, INVADER assays, Third 
Wave Technologies; See e.g., U.S. Patent Nos. 5,846,717; 6,090,543; 6,001,567; 5,985,557; 
5 6,090,543; 5,994,069; Lyamichev et al., Nat. Biotech., 17:292 (1999), Hall et al., PNAS, 
USA, 97:8272 (2000), W097/27214 and W098/42873, each of which is herein 
incorporated by reference in their entirety for all purposes). 

The INVADER assay can be configured to detect gene dosage or specific mutations 
and SNPs in unamplified, as well as amplified, RNA and DNA including genomic DNA. In 
) the embodiments shown schematically in Figure 1 , the INVADER assay uses two cascading 
steps (e.g., a primary and a secondary reaction) to generate and then to amplify the target- 
specific signal. For convenience, the targets in the following discussion are described as 
Target 1 and Target 2, even though this terminology does not apply to all genetic variations. 
For example, in some embodiments, Target 1 is a wild type form of a gene and Target 2 is a 
variant or mutant form of a gene. In other embodiments, Target 1 and Target 2 are genes 
present in different copy numbers in different samples. In the primary reaction (Figure 1), 
the Target 1 primary probe and the INVADER oligonucleotide hybridize in tandem to the 
target nucleic acid to form an overlapping structure. An unpaired "flap" is included on the 
5" end of the Target 1 primary probe. A structure-specific enzyme (e.g., a CLEAVASE 
enzyme, Third Wave Technologies) recognizes the overlap and cleaves off the unpaired 
flap, releasing it as a target-specific product. In the secondary reaction, this cleaved product 
serves as an INVADER oligonucleotide on the Target 1 fluorescence resonance energy 
transfer (WT-FRET) probe to again create the structure recognized by the structure specific 
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enzyme (panel A). When the two dyes on a single FRET probe are separated by cleavage 
(indicated by the arrow in Figure 1), a detectable fluorescent signal above background 
fluorescence is produced. Consequently, cleavage of this second structure results in an 
increase in fluorescence, indicating the presence of the Target 1 allele. In some 
5 embodiments, FRET probes having different labels (e.g., resolvable by difference in 

emission or excitation wavelengths, or resolvable by time-resolved fluorescence detection) 
are provided for each allele or locus to be detected, such that the different alleles or loci can 
be detected in a single reaction. In such embodiments, the primary probe sets and the 
different FRET probes maybe combined in a single assay, allowing comparison of the 
0 signals from each allele or locus in the same sample. 

Figure 1 shows one illustrative embodiments of the INVADER assay where wild 
type and mutant alleles are detected. In the embodiment shown in Figure 1, the primary 
INVADER assay reaction is directed against the target DNA (or RNA) being detected. 
Generally, the target DNA is the limiting component in the first invasive cleavage, since the 
5 INVADER and primary probe are generally supplied in molar excess. In the second 

invasive cleavage, it is the released flap that is limiting. When these two cleavage reactions 
are performed sequentially, the fluorescence signal from the composite reaction 
accumulates exponentially, and quantitatively reflects the amount of target DNA amount. 
Hall, et al., PNAS 2000, supra. 
0 In the embodiment shown in Figure 1 , if the primary probe oligonucleotide and the 

target nucleotide sequence do not match (complement) at the cleavage site (e.g., as with the 
MT primary probe and the WT target, Figure 1, panel B), the overlapped structure does not 
form and cleavage is suppressed. The structure-specific enzyme (e.g., CLEAVASE VDI 
enzyme, Third Wave Technologies) cleaves the overlapped structure more efficiently (e.g., 
at least 340-fold) than the non-overlapping structure, allowing excellent discrimination of 
the alleles. 

The assay is generally configured such that the probes turn over without temperature 
cycling to produce many signals per target (i.e., linear signal amplification). Similarly, each 
target-specific product can enable the cleavage of many FRET probes. In alternative 
embodiments, the assay can be configured to use temperature cycling to facilitate probe 
turnover. 

In certain embodiments, the INVADER assay, or other nucleotide detection assays, 
are performed with oligonucleotides selected to detect sites on a target strand that have been 
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found to be particularly accessible. In some embodiments, the INVADER assays are 
performed using one or more structure bridging oligonucleotides. Such methods, 
procedures and compositions are described in U.S. Pat. 6,194,149, WO9850403, and 
W0O198537, all of which are specifically incorporated by reference in their entireties. 
5 In certain embodiments, the target nucleic acid sequence is amplified prior to 

detection (e.g., such that synthetic nucleic acid is generated). In some embodiments, the 
target nucleic acid comprises genomic DNA. In other embodiments, the target nucleic acid 
comprises synthetic DNA or RNA. In some preferred embodiments, synthetic DNA within 
a sample is created using a purified polymerase. In some preferred embodiments, creation 

1 0 of synthetic DNA using a purified polymerase comprises the use of PCR. In other preferred 
embodiments, creation of synthetic DNA using a purified DNA polymerase, suitable for use 
with the methods of the present invention, comprises use of rolling circle amplification, 
(e.g., as in U.S. Pat. Nos. 6,210,884, 6,183,960 and 6,235,502, herein incorporated by 
reference in their entireties). In other preferred embodiments, creation of synthetic DNA 

1 5 comprises copying genomic DNA by priming from a plurality of sites on a genomic DNA 
sample. In some embodiments, priming from a plurality of sites on a genomic DNA sample 
comprises using short (e.g., fewer than about 8 nucleotides) oligonucleotide primers. In 
other embodiments, priming from a plurality of sites on a genomic DNA comprises 
extension of 3' ends in nicked, double-stranded genomic DNA (i.e., where a 3' hydroxyl 

20 group has been made available for extension by breakage or cleavage of one strand of a 
double stranded region of DNA). Some examples of making synthetic DNA using a 
purified polymerase on nicked genomic DNAs, suitable for use with the methods and 
compositions of the present invention, are provided in U.S. Patent Nos. 6,1 17,634, issued 
September 12, 2000, and 6,197,557, issued March 6, 2001, and in PCT application WO 

25 98/39485, each incorporated by reference herein in their entireties for all purposes. 

In other embodiments, synthetic DNA suitable for use with the methods and 
compositions of the present invention is made using a purified polymerase on multiply- 
primed genomic DNA, as provided, e.g., in U.S. Patent Nos. 6,291,187, and 6,323,009, and 
in PCT applications WO 01/88190 and WO 02/00934, each herein incorporated by 

30 reference in their entireties for all purposes. In these embodiments, amplification of DNA 
such as genomic DNA is accomplished using a DNA polymerase, such as the highly 
processive $ 29 polymerase (as described, e.g., in US Patent Nos. 5,198,543 and 5,001,050, 



26 



WO 2005/007869 



PCT/US2004/022014 



each herein incorporated by reference in their entireties for all purposes) in combination 
with exonuclease-resistant random primers, such as hexamers. 

In certain embodiments, the present invention provides kits for assaying a pooled 
sample (e.g.. a pooled blood sample) using INVADER detection reagents (e.g., primary 
probe, INVADER probe, and FRET cassette). In preferred embodiments, the kit further 
comprises instructions on how to perform the INVADER assay and specifically how to 
apply the INVADER detection assay to pooled samples from many individuals, or to 
"pooled" samples from many cells (e.g., from a biopsy sample) from a single subject. 

The present invention further provides assays in which the target nucleic acid is 
reused or recycled during multiple rounds of hybridization with oligonucleotide probes and 
cleavage of the probes without the need to use temperature cycling (i.e., for periodic 
denaturation of target nucleic acid strands) or nucleic acid synthesis (i.e., for the 
polymerization-based displacement of target or probe nucleic acid strands). When a 
cleavage reaction is run under conditions in which the probes are continuously replaced on 
the target strand (e.g., through probe-probe displacement or through an equilibrium between 
probe/target association and disassociation, or through a combination comprising these 
mechanisms, [The kinetics of oligonucleotide replacement. Luis P. Reynaldo, Alexander V. 
Vologodskii, Bruce P. Neri and Victor I. Lyamichev. J. Mol. Biol. 97: 51 1-520 (2000)], 
multiple probes can hybridize to the same target, allowing multiple cleavages, and the 
generation of multiple cleavage products. 

In preferred embodiments, control and test samples to be compared in any given 
experiment are purified by the same method. In particularly preferred embodiments, DNA 
is quantified following purification, e.g. by PICOGREEN (Molecular Probes, Eugene, OR) 
assay or A 260 , and before analysis and comparable amounts of the appropriate controls and 
test samples are added to the respective assays. In particularly preferred embodiments, the 
amount of DNA added to a test sample is between 10-160 ng/20/d assay or 3-30ng/10/d 
assay. 

The INVADER assay is suitable for use with a variety of sample types. For 
example, in some embodiments of the present invention utilizing INVADER assay 
detection of gene dosage sample types include, but are not limited to, amniocyte cells, cystic 
hygroma fluid, amniocyte cell culture, amniotic fluid, chorionic villi, fetal urine, fetal skin, 
and fetal blood (See e.g., Donnenfeld and Lamb, Clin. Lab. Med. 23:457 [2003]; herein 
incorporated by reference). 
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In some particularly preferred embodiments, the selection of specific sequences for 
detection by, and the design of oligonucleotides for use in an INVADER assay is carried out 
using computational methods. In some embodiments, such methods for the design of 
oligonucleotides that successfully hybridize to appropriate regions of target nucleic acids 
5 under the desired reaction conditions (e.g., temperature, buffer conditions, etc.) for the 
detection assay. In some embodiments of the present invention, assay design is carried out 
using INVADERCREATOR software (Third Wave Technologies, Madison, Wis.), which 
calculates ideal oligonucleotide sequences and reaction conditions for conducting invasive 
cleavage reactions, e.g., as described in US Patent Applications Ser. No. 09/864,636 and 

10 10/336,446, each incorporated herein in its entirety for all purposes. 

In some embodiments, a multistep computational approach is applied to identify 
sequences that are unique in the human genome. In some preferred embodiments, some of 
the steps of a computational approach involve the creation of integrated databases of human 
genomic sequences or sequence variations. Variation in the human genome sequence 

1 5 accounts for a large fraction of observed differences between individuals, and it has been 
established that common genetic variants underlie susceptibility to many diseases as well as 
response to therapeutic treatments. To date over 10 million variations have been submitted 
to over 8 databases in the public domain. 

While the number and the mapping of genes and sequence variations differs between 

20 public databases, integrating these data points and applying computational algorithms 

creates a platform to enable the detection of sequences without known variation which mark 
gene regions. Gene regions may encompass both known or predicted regulatory sites, 
exons, splice sites and other functional regions. In order to identify these important regions, 
a multi-step, iterative computational approach can be applied. For example, a program 

25 termed EXON TAGGER was designed to identify marker sequences for regions within an 
exon, which can be used to identify either the complete exon or a part of it. One 
embodiment of this approach is schematized in Figure 2. 

In a preferred embodiment, the EXON TAGGER computational schema begins with 
a curated collection of known genes and sequence variations that are integrated into an 

30 internal database and mapped to the current assembly of the human genome. The database 
integrates data sets from diverse sources by applying data normalization techniques, 
establishing a common vocabulary for labeling data elements, and resolving data 
inconsistencies through manual curation. The database provides the platform for analyzing 
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gene regions and sequence variants in gene regions. In one particularly preferred 
embodiment, NCBI's collection of Reference Sequences (RefSeq) serves as the primary 
source of genes. Secondary sources may include proprietary sequences, computational 
prediction of coding regions using gene prediction programs, and published literature. 
5 Several sources of genetic variation are integrated and include NCBI's database of SNPs 
(dbSNP), the database of Japanese single nucleotide polymorphisms (JSNP), Human 
Genome Variation Database (HGVbase), and variants identified in published literature and 
public databases with a focus on pharmacogenomic variants. The primary source for 
genome assembly and the accompanying annotation are from the University of California 

1 0 Santa Cruz GoldenPath (UCSC GoldenPath). 

In some preferred embodiments, the EXON TAGGER computational schema is 
designed to accommodate several empirically determined and experimentally significant 
variables. These factors include, sequence variation within the exon tag, uniqueness of the 
tag across the genome, and the suitability of any given region for analysis by a nucleic acid- 

15 based assay, e.g. the INVADER assay, AS-PCR, or TaqMan. The computational algorithm 
relies upon an iterative process to identify a marker sequence that will uniquely identify an 
exon by utilizing a dynamic programming approach. The algorithm also filters marker 
sequences to identify the "best" candidates for development of a molecular assay by 
considering the presence of sequence variants and several local sequence content elements. 

20 The local sequence content examined includes the presence of repetitive elements and 

simple repeats, and GC content. Additional analysis using methods known in the art can be 
applied to identify the presence of pseudogenes, gene duplications, and other homologies 
that may interfere with the interpretation of the molecular assay results. 

25 Selecting Sequences to Represent Genes: 

In some embodiments, it is preferred to obtain sequences with the most annotation. 
In other embodiments, RefSeq sequences are used to identify genes. In still further 
embodiments, it is desired to identify exonic regions. RefSeq sequences are curated mRNA 
sequences which identify the exon start and stop sites of various genes according to the 
30 current assembly, in addition to providing sequence annotation identifying the untranslated 
region (UTR) sites, coding sequence (CDS) start sites and any other annotation associated 
with the mRNA sequence. At times, more than one reference sequence may exist for a 
given gene. In this case, there are multiple splice forms of the gene that are known to exist. 
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In some embodiments, to get the most complete form of the gene, the RefSeq sequence that 
represents the longest sequence is identified by summing the exon start and stop sites listed 
for the RefSeq sequence. In some embodiments, if there are multiple RefSeq Sequences, 
the RefSeq sequence with the lowest RefSeq ID is chosen to represent the gene. In other 
5 embodiments, all RefSeq splice variants are entered into the exon tagger program and 
duplicate exon tags are filtered at the end. Duplicate tags between RefSeq entries are 
removed by removing tags that have 100% identity to another tag (e.g., via pairwise 
comparison). In the case of duplicate tags, the tag with the lowest RefSeq id is retained. By 
finding the unique tags, sequences that may be in one splice variant but not another, are not 

1 0 lost by filtering arbitrarily by sequence size. This anchors the rule set back to the actual 
sequence). This RefSeq sequence is the foundation for the analysis and provides an 
annotation to find the sequence for each exon in the gene. Candidate genes are identified by 
their approved gene symbol, LocusLink identifier, reference mRNA accession (NM_#), or 
reference protein accession (NP_#). Selecting the size of the exon tag: 

1 5 The size of exons within a gene is highly variable and may be many nucleotides in 

length or just a few. In a preferred embodiment, exon tags are 91 base pairs in length. If 
the exon is shorter than 91 base pairs in length the program resets the exon tag size to be the 
same as the size of the exon. The desired size of the exon tag can be set as a parameter in 
the EXON TAGGER program to be any given length. Once the exon tag size is 

20 determined, a set of tags for the targeted exon is created. These tags are then checked to see 
if they occur uniquely in the genome and do not appear to have any sequence features that 
may make them problematic assays (see sections below). If no appropriate exon tags are 
found, the program resets the exon tag size to the current exon tag size minus 10 and re-tests 
the new tags. This process continues until the exon tag size is less than 50bp. If no 

25 appropriate exon tags are found for the exon or the exon size is less than 50 base pairs, 
potential exon tags are examined by hand. 

Creating exon tags: 

Exon tags are created by getting the sequence for the exon from our integrated 
30 database. Once the complete sequence for the exon is acquired, the program steps down the 
exonic sequence to create exon tags using the selected exon tag size. Once an exon tag is 
created the program steps down the exonic sequence by a specified number of bases (default 
= 5bp) to create the next exon tag. 
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Finding a Unique Tag Sequence for an Exon: 

In some cases, RefSeq sequences may map to multiple regions within the genome, 
e.g. when pseudogenes are present, when a given sequence is found to have multiple 
5 mappings in genome assembly or partial homology to other regions or members of the same 
gene family, or may be otherwise incompletely assembled. If significant homology to 
multiple genomic regions is detected, (e.g., if a gene is duplicated in the genome), the exon 
tags for the gene may require PCR to amplify the region of interest before testing for each 
exon tag. Alternatively, in some embodiments, it may be desirable to target such duplicated 

10 sequences in order to increase signal generated from a discrete assay. In some embodiments, 
the EXON TAGGER algorithm can be directed to eliminate candidate exon tags that may 
contain repetitive element and therefore may be duplicated in the genome. In other 
embodiments, such repetitive elements may be retained. In some embodiments, candidate 
exon tags are compared against the current assembly of the genome to verily that each of 

15 the candidate tags appears only the anticipated number of times in the genome. 

Checking each Candidate Exon Tag for Assay Ability: 

In order to provide the best possible exon tags for each exon, the EXON TAGGER 
program looks for strings of G's or C's and SNPs within the tag sequence. If the exon tag 

20 sequence is found to have SNPs or strings of 5 or more G's or C's within the 40bp around 
the center of the exon tag, the candidate exon tag sequence is removed. In addition, the 
program can be designed to measure GC content within the tag sequence. If desired, the 
number of G's or C's, as well as the overall GC content, can be modulated. In preferred 
embodiments, sequences that have more than 70% GC content or less than 20% GC content 

25 are removed. In particularly preferred embodiments, sequences having more than 60% GC 
content or less than 40% GC content are removed. 



EXPERIMENTAL EXAMPLES 

The following examples are provided to illustrate certain embodiments of the 
30 invention without being intended to limit the invention to the specific applications 
described. 

In the disclosure that follows, the following abbreviations apply: Ex. (Example); 
.Fig. (Figure);°C (degrees Centigrade); g (gravitational field); hr (hour); min (minute); olio 
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(oligonucleotide); rxn (reaction); vol (volume); w/v (weight to volume); v/v (volume to 
volume); BSA (bovine serum albumin); CTAB (cetyltrimethylammonium bromide); HPLC 
(high pressure liquid chromatography); DNA (deoxyribonucleic acid); p (plasmid); ml 
(microliters); ml (milliliters); mg (micrograms); mg (milligrams); M (molar); mM 
5 (milliMolar); mM (microMolar); pmoles (picomoles); amoles (attomoles); zmoles 

(zeptomoles); nm (nanometers); kdal (kilodaltons); OD (optical density); EDTA (ethylene 
diamine tetra-acetic acid); FITC (fluorescein isothiocyanate); SDS (sodium dodecyl 
sulfate); NaP04 (sodium phosphate); NP-40 (Nonidet P-40); Tris (tris(hydroxymethyl)- 
aminomethane); PMSF (phenylmethylsulfonylfluoride); TBE (Tris-Borate-EDTA, i.e., Tris 

10 buffer titrated with boric acid rather than HC1 and containing EDTA); PBS (phosphate 
buffered saline); PPBS (phosphate buffered saline containing 1 mM PMSF); PAGE 
(polyacrylamide gel electrophoresis); Tween (polyoxyethylene-sorbitan); Red (REDMOND 
RED Dye, Epoch Biosciences, Bothell WA) Z28 (ECLIPSE Quencher, Epoch Biosciences, 
Bothell, WA); ATCC (American Type Culture Collection, Rockville, MD); Coriell (Coriell 

15 Cell Repositories, Camden, NJ); DSMZ (Deutsche Sammlung von Mikroorganismen und 
Zellculturen, Braunschweig, Germany); Ambion (Ambion, Inc., Austin, TX); Boehringer 
(Boehringer Mannheim Biochemical, Indianapolis, IN); MJ Research (MJ Research, 
Watertown, MA; Sigma (Sigma Chemical Company, St. Louis, MO); Dynal (Dynal A.S., 
Oslo, Norway); Gull (Gull Laboratories, Salt Lake City, UT); Epicentre (Epicentre 

20 Technologies, Madison, WI); Lampire (Biological Labs., Inc., Coopersberg, PA); MJ 
Research (MJ Research, Watertown,MA); National Biosciences (National Biosciences, 
Plymouth, MN); NEB (New England Biolabs, Beverly, MA); Novagen (Novagen, Inc., 
Madison, WI); Perkin Elmer (Perkin-Elmer/ABI, Norwalk, CT); Promega (Promega, Corp., 
Madison, WI); Stratagene (Stratagene Cloning Systems, La Jolla, CA); Clonetech 

25 (Clonetech, Palo Alto, CA) Pharmacia (Pharmacia, Piscataway, NJ); Milton Roy (Milton 
Roy, Rochester, NY); Amersham (Amersham International, Chicago, EL); and USB (U.S. 
Biochemical, Cleveland, OH). Glen Research (Glen Research, Sterling, VA); Coriell 
(Coriell Cell Repositories, Camden, NJ); Gentra (Gentra, Minneapolis, MN); Third Wave 
Technologies (Third Wave Technologies, Madison, WI); PerSeptive Biosystems 

30 (PerSeptive Biosystems, Framington, MA); Microsoft (Microsoft, Redmond, WA); Qiagen ' 
(Qiagen, Valencia, CA); Molecular Probes (Molecular Probes, Eugene, OR); VWR (VWR 
Scientific, ); Advanced Biotechnologies (Advanced Biotechnologies, INC., Columbia, MD). 
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EXAMPLE 1 

Measurement of gene dosage using the INVADER assay 
A. DNA Samples 

5 Aneuploidy cell lines were obtained from the Coriell Institute for Medical Research. 

DNA was isolated using the Gentra Systems, Inc. (Minneapolis, MN) PUREGENE DNA 
Purification Kit (for manual purification) or AUTOPURE LS (for automated purification). 
The following cell lines were obtained: 



Description 


Karyotype 


Repository # 


Cell Type 


Trisomy 18 


48, XXX, +18 


GM03623 


Fibroblast 


Trisomy 18 


47, XX, +18 


GM02422 


Amniotic fluid 


Trisomy 1 8 


47, XX, +18 


AG12614 


Fibroblast 


Trisomy 13 


47, XX, +13 


AG 12070 


Aminotic fluid 


Trisomy 13 


47,XY,+13 


GM03330 


Fibroblast 


Trisomy 13 


47, XY.+13 


GM02948A 


Fibroblast 


Trisomy 21 


47, XY, +21 


AG09394 


Lymphoblast 


Trisomy 21 


47, XX, +21 


AG 13429 


Lymphoblast 


Trisomy 21 


47, XX, +21 


AG10098 


Lymphoblast 


Monosomy 21 


peripheral blood 
lymphocyte sample: 
46, XX, -21, 
+t(21;21), 
lymphoblast culture 
was 45, XX, -21 


NA0 1201 (DNA) 


Lymphoblast 
(provided as DNA) 


XYY Syndrome 


47, XYY 


GM01250A 


Fibroblast 


XYY Syndrome 


47, XYY 


GM09326 


Fibroblast 


XXXXX Syndrome 


48, XXXX; 49, 
XXXXX 


GM05009C 


Fibroblast 


Iso X Syndrome 


45, X/46, X, i (X) 
(qter>cen>qter) 


GM03543 


Lymphocyte 


XO Syndrome/ 
Turner Syndrome 


45, X/46, X, i (X) 
(qter>cen>qter) 


GM13166 | 


Lymphocyte 
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Turner Syndrome 


45, X 


AG08006 


Lymphocyte 


Aneuploid 


48.XXXX 


GM01416E 


Lymphocyte 



Table 1 : Cell lines and their genotypes. 



Control disomic samples were prepared using either the Gentra Systems, Inc. 
(Minneapolis, MN) PUREGENE DNA Purification Kit (for manual purification) or 
AUTOPURE LS (for automated purification) to purify genomic DNA from whole blood or 
tissue culture cells. Both kits were used according to the manufacturer's protocols. DNA 
was prepared from cultured amniocytes using a homebrew phenol chloroform DNA 
extraction method. Unless stated otherwise, control and test samples to be compared in any 
given experiment were purified by the same method. DNA was quantified following 
purification, e.g. by PICOGREEN (Molecular Probes, Eugene, OR) assay or A 26 o, before 
analysis and comparable amounts of the appropriate controls and test samples, ranging from 
5-160 ng of DNA, were added to the respective assays. 

B. INVADER oligonucleotide designs 

The EXON TAGGER method (see Description) was used to identify candidate 
regions as suitable targets for the INVADER assay. The INVADER CREATOR program 
was then used to design INVADER and probe oligonucleotide sequences. Oligonucleotides 
comprising appropriate designs were synthesized using conventional procedures. Example 
2 contains a description of how the use of this program was modified to result in the 
selection of candidate target regions with various degrees of chromosomal specificity. 

C. INVADER assay reagents and methods 

Figure 3 lists the genes on each chromosome targeted for analysis and the 
oligonucleotide sequences of the INVADER and probe oligonucleotides used to detect the 
various genes. For each of these sequences, the 5' portion ("flap") is highlighted with 
underlining. The remaining non-underlined part of the sequences is the 3' portion (Target 
Specific Region). Also, fragments that would be generated during an invasive cleavage 
reaction with these sequences (and the indicated INVADER oligonucleotides) are the 
underlined sequence (5' portion) plus the first base from the 3' portion. These fragments are 
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designed to participate in a second invasive cleavage reaction with a FRET cassette by 
serving as the INVADER (upstream) oligonucleotide in this second invasive cleavage 
reaction. All of these probe oligonucleotides contain hexanediol as a 3' blocking group. 
INVADER assays were set up to determine chromosome copy number as follows. 
5 Target DNA was provided as genomic DNA prepared as described above. Biplex 
INVADER reactions (e.g. as shown in Figure 1), in which a chromosome-specific 
oligonucleotide set was biplexed with oligonucleotides directed to an internal control (e.g. a 
portion of the ct-actin gene (ACTA1), with oligonucleotide sequences of SEQ ID NOs:l and 
100; ACTA1; Figure 3), were carried out in a final volume of 20 p\ in a 96-well microplate. 
10 The chromsomes and genes targeted, their chromsomal location ("cytoband"), and the 
Genbank accession number consulted for assay design are listed below. 



Lnromo 
some 


****** 

Oene 


Genbank Accession # 


Cytoband 


01 
Z I 


CTpU 


INJVi UUO7H0 






ju'ov-'.rvAJ 








AMLl 
exon 1 


NM_001754 


21q22.12 




AMLl 
exon 4 


NM_001754 


21q22.12 


X 


AR 


NM_000044 


Xql2 




LI CAM 


NM_000425 


Xq28 




PDCD8 


NM_004208 


Xq26.1 




PPEF1 


NM_006240 


Xp22.13 


Y 


SRY 


NM 003140 


Ypll.31 




ElflAY 


NM_004681 


Yq 11.222 


18 


GATA6 


NM_005257 


18ql 1.2 




SERPIN 
B 


NM_002575 


18q22.1 


13 


CCNA 


NM_003914 


13ql3.3 




ING1 


NM_005537 


13q34 




DLEU1 


NM_005887 


13ql4.2 
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Table 2: Chromosomes and genes targeted in the INVADER assay. 

Aliquots of 10 /xl of each sample (genomic DNA, final amounts ranging between 5 
ng to 160ng per reaction) or no target control (100 ng//xl tRNA) were added to the 
5 appropriate wells and then overlaid with 25 /xl mineral oil. Samples were denatured at 95°C 
for 5 minutes and then cooled to 75°C. A 10 fii aliquot of the following INVADER reaction 
mix was then added to each well and mixed by pipetting: 



Component 


Amount per 
reaction 


Final 

concentrations 


DNA reaction buffer 1 (14%PEG, 40 mM 
MOPS, pH 7.5, 56 mM MgCl 2 , 0.02% ProClin 
300) 


5 \i\ 


3.5% PEG, 10 
mM MOPS, 14 
mM MgCl 2 


Chromosome Specific Primary 
probe/INVADERoligo(10uM/l piM) 


1 /xl 


10 pmol probe, 
1 pmol invader 


Internal Control Primary Probe/Invader(10 
/xM/l^iM) 


1 /xl 


10 pmol probe, 1 
pmole invader 


FAM FRET (10/iM) SEQ ID NO: 199 [Fam- . 
TCT-Z28- 

AGCCGGTTTTCCGGCTGAGACCTCGGCG 
CG-hex] 


0.5 /xl 


5 pmol 


RED FRET (10/iM) SEQ ID NO: 200 [Red- 
TCT-Z28- 

TCGGCCTTTTGGCCGAGAGACTCCGCGT 
CCGT-hex] 


0.5 /il 


5 pmol 


CLEAVASE X enzyme (40ng//il) in 
CLEAVASE dilution buffer 


1 /xl 


40 ng 


RNAse free water 


lul 





Table 3: INVADER assay reaction components 

10 

Reactions were incubated at 63°C for 4 hours and then cooled to 4°C prior to 
scanning in a CYTOFLUOR 4000 fluorescence plate reader (Applied Biosystems, Foster 
City, CA). The settings used were: 485/20 nm excitatioivT)andwidth and 530/25 nm 
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emission/bandwidth for FAM dye detection, and 560/20 nm excitation/bandwidth and 
620/40 nm emission/bandwidth for RED dye detection. The instrument gain was set for 
each dye so that the No Target Blank produced between 100 - 250 Relative Fluorescence 
Units (RFUs). NOTE: Because the optimal gain setting can vary between instruments, 
5 adjust the gain as needed to give the best signal/background ratio (sample raw signal 
divided by the No Target Control signal) or No Target Control sample readings of - 100 
RFUs. Fluorescence microplate readers that use a xenon lamp source generally produce 
higher RFUs. For directly reading the microplates, the probe height of, and how the plate is 
positioned in, the fluorescence microplate reader is adjusted according to the manufacturer's 
1 0 recommendations. 

The raw data that is generated by the device/instrument is used to measure the assay 
performance (real-time or endpoint mode). The equations below provide how FOZ (Fold 
Over Zero), and other values are calculated. NTC in the equations below represents the 
signal from the No Target Control. 

15 

FOZ or Signal/No Target 

FOZoyei =(RawSignal D yei/NTC D yei) 
FOZ Dye2 = (RawSignal D ye2/NTC D ye2) 

20 In the following examples, FOZoyd corresponds to the signal from the chromosome-specific 

assay, and FOZoye2, to that from the internal control assay. The two FOZ values (i.e. 

chromosome-specific and internal control) for each sample were used to calculate the 

chromosome-specific: internal control Ratio as follows: 

Ratio = fNet chromosome specific FOZ) 
25 (Net internal control FOZ) 

where Net FOZ = FOZ - 1 

To determine the ratio normalized to two copies per genome, the following calculation is 
performed with the Net FOZ ratios. 

30 

Normalized Ratio= (Ratio/Average 2-copy ratio) X2 
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C. Limit of Detection (LOD) of INVADER assays 

Experiments were conducted to examine the effect of varying DNA concentration on 
assay performance and the ability to discriminate genotypes. In particular, the effects of 
limiting (i.e., 5 ng) genomic DNA were addressed. 
5 INVADER assays were set up as described in Example IB, with the final amounts 

of genomic DNA ranging from 0-160 ng. DNA was quantified using the PICOGREEN test 
(Molecular Probes, Eugene OR). Assays were designed to genes on chromosome 21 : DSCR 
6 (SEQ ID NOs: 17 and 1 16); STCH (SEQ ID N02: 217 and 218) and AML exons 1 (SEQ 
ID NOs: 20 and 119) and 4 (SEQ ID NOs: 22 and 121). The results are presented in Figure 
10 4 and indicate that the assays were informative across the range of DNA concentrations 

tested at or above 10 ng. Samples containing just 5 ng of genomic DNA failed to yield FOZ 
values above 1.15, which were therefore excluded from consideration as being below the 
limit of robust detection. DNA concentrations greater than 1 60 ng were not tested in this 
experiment. 

15 

D. Results of INVADER assays to detect variations in gene dosage 

The results of these experiments from reactions directed to chromosomes 13, 18, 21, 
X and Y are presented in Figures 5-9, respectively. The samples are aligned along the X- 
axis and grouped according to sample type (i.e. disomy or trisomy). The normalized ratio is 
20 indicated along the Y-axis as denoted in the legend. 

i. Analysis of chromosome 13 copy number 

Figure 5 presents the results of the analysis of INVADER assays to determine copy number 
of loci carried on chromosome 13: deleted in lymphocytic leukemia, 1, 
(DLEU; SEQ ID NOs: 37 and 1 36); cyclin Al (CCNA; SEQ ID NOs: 223 and 224); and 
25 inhibitor of growth family, member 1 (ING1 ; SEQ ID NOs: 227 and 228). The samples 

tested are listed along the X-axis and labeled. Samples G2, 5, 15, 16, and 25 were obtained 
from normal individuals presumed to be disomic for chromosome 13; samples GM03330 
and GM02948A were obtained from Coriell and are trisomic for chromosome 13 (see Table 
1). 

30 These results demonstrate that the INVADER assay can readily discriminate disomy 

(normal, 2 copy) and trisomy (abnormal, 3 copy) cases at these chromosome 13 loci. 

ii. Analysis of chromosome 1 8 copy number 
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Figure 6 presents the results of the analysis of INVADER assays to determine copy 
number of loci carried on chromosome 18: GATA-binding protein 6; (GATA6, SEQ ID 
NOs: 35 and 134) and serine (or cysteine) proteinase inhibitor, clade B (ovalbumin), 
member 2 (SERPINB2; SEQ ID NOs: 36 and 135). The samples tested are listed along the 
5 X-axis and labeled. Samples G2, 5, 15, 16, and 25 were obtained from normal individuals 
presumed to be disomic for chromosome 1 8; sample GM03623 was obtained from Coriell 
and is trisomic for chromosome 18 (see Table 1); M and A refer to "manual" and 
"autopure" and describe the procedures used to isolate the DNA from the Coriell cell line. 

These results demonstrate that the INVADER assay can readily discriminate disomy 
10 (normal, 2 copy) and trisomy (abnormal, 3 copy) cases at these chromosome 1 8 loci. 

iii. Analysis of chromosome 21 copy number 

Figures 7 A-D present the results of the analysis of INVADER assays carried out on 
DNA extracted from blood to determine copy number of loci carried on chromosome 2 1 : 
DSCR6 (Downs' syndrome critical region 6; SEQ ID NOs: 17 and 1 16), AML exon 1 (runt- 

15 related transcription factor 1 (acute myeloid leukemia 1, amll oncogene)), SEQ ID NOs: 20 
and 119), STCH (stress 70 protein chaperone; SEQ ID NOs: 217 and 218); and AML exon 
4 ((runt-related transcription factor 1 (acute myeloid leukemia 1, amll oncogene)), SEQ ID 
NOs: 22 and 121; see Table 2 for chromosome locations). The samples tested are listed 
along the X-axis and labeled. Samples 2-9 were obtained from normal individuals 

20 presumed to be disomic for chromosome 21, Sample 1 was obtained from Coriell 

(NA01201) and is monosomic for chromosome 21 and Samples 10 and 1 1 were obtained 
from Coriell (AG 13429 and AG09394 respectively) and are trisomic for chromosome 21 . 

Figures 7 E-F present the results of the analysis of INVADER assays carried out on 
blinded genomic DNA samples isolated from cultured amniocytes to determine copy 

25 number of loci carried on chromosome 2 1 : DSCR6 (Downs' syndrome critical region 6; 
SEQ ID NOs: 17 and 1 16); STCH (stress 70 protein chaperone; SEQ ID NOs: 217 and 
218). Both of these INVADER assays reported Samples 1-5 to be disomic for chromosome 
21 and Sample 6 to be trisomic for chromosome 2 1 . INVADER assay results corresponded 
with karyotype results. 

30 These results demonstrate that the INVADER assay can readily discriminate 

disomic (normal, 2 copy) and monosomic or trisomic mutant karyotypes at these 
chromosome 21 loci. 

iv. Analysis of X and Y chromosome copy number 
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Figure 8 presents the results of the analysis of INVADER assays to determine copy 
number of two different loci carried on different arms of the X chromosome: PDCD8 
(programmed cell death; Xq26.1; SEQ ID NOs: 31 and 130), and PPEF1 (protein 
phosphatase, EF-hand calcium-binding domain 1; Xp22.13; SEQ ID NOs: 32 and 131). The 
5 samples tested are listed along the X-axis and labeled. In addition to testing 10 genomic 
DNA samples isolated from the blood of normal individuals (5 each from XX females 
[samples 1-5] and XY males [samples 6-10]), genomic DNA from individuals with various 
X/Y anomalies was tested (see Table 1). 

These results demonstrate that the INVADER assay can readily discriminate 

10 normal XX individuals from normal XY individuals. In addition, the following anomalous 
genotypes were analyzed: 45 X (sample 11; Coriell No. AG08006); 2 45X/46X (iso)X 
samples, both qter >cen> qter, (i.e. the q arms are duplicated, and the p arm is absent) 
(sample 12, GM13166: 30% 45, X / 70% 46, X(iso)X and sample 13, GM03543: 40% 45, 
X / 60% 46, X(iso)X; 48 XXX, +18 (sample 14; GM03263), 48XXXX (sample 15; 

15 GM01416E), 48 XXXX/49XXXXX (sample 16; GM05009C) and two 47 XYY samples 
(samples 17-18; GM01250A and GM09324). Samples 12 and 13 appear to contain more 
copies of the q arm of the X chromosome (Figures 8A) than of the p arm (Figure 8b), 
consistent with the presence of a q duplicated isochromosome in those samples. The 
remaining samples contain additional copies of the X chromosome 

20 Figure 9 shows the results of samples analyzed at two different loci on the Y 

chromosome: SRY (sex determining region Y, SEQ ID NOs: 33 and 132) and EIF1AY 
(eukaryotic translation minitiation factor 1A, Y chromosome, SEQ ID NOs: 34 and 133). 
Samples 1-5 are normal XX samples; and 6-10, normal XY samples. Samples 11-12 contain 
XYY samples, as in samples 17 and 18 of Figure 8. 

25 The results in Figure 9 indicate that, as expected, the normal XX samples contain no 

copies of the Y chromosome. However, the EDF1AY probe set (SEQ ID NOs: 34 and 133) 
appears to be somewhat less specific for the Y chromosome than does the SRY probe set, 
suggesting that it may be desirable to examine alternative Y chromosome sequences to 
ensure specificity. Moreover, the XYY samples appear to contain more than just the two 

30 copies of Y anticipated, further suggesting that homologous sequences may be being 

detected by the probe sets or that there was a potential problem with the preparation of this 
genomic DNA sample. It is noteworthy, however, that the combination of results presented 

40 



WO 2005/007869 



PCT/US2004/022014 



in Figures 8 and 9 lead to a consistent determination of the number of X chromosomes in 
these two samples and to the definitive presence of aneuploidy vis a vis the Y chromosome. 

EXAMPLE 2 

5 Use Of The EXON TAGGER Program To Identify Target Regions 

A. Target Sequences in the DSCR Gene on Chromosome 21 

The EXON TAGGER approach was used to identify appropriate, unique 50-mer 
sequences in the DSCR 6 (Downs Critical Region) gene, on chromosome 21. An initial 

10 analysis was done using the June, 2002 human genome assembly. INVADER assays were 
designed to this sequence with the probe and INVADER oligonucleotides comprising SEQ 
ID NOs: 15 and 114, respectively. Assays were carried out as described in Example 1 . 

While these oligonucleotide sets appeared to detect the targeted sequence in the 
DSCR 6 gene, when a sample known to be trisomic for chromosome 21 was tested, the 

15 INVADER assay failed to detect a significant increase in signal, as would be expected in 
the presence of an additional copy of the gene (Figure 10). It was discovered that the 
DSCR6 50mer sequence was part of a SINE (Alu) repeat and was highly homologous to . 
regions on several different chromosomes. The megaBLAST criteria for removing a 
potential 50mer were set so that only sequences that had 100% homology to other regions 

20 were removed, and allowed 50mers with </=99% homology to still remain. The presence of 
this additional homology had the effect of elevating the total signal such that small 
differences (e.g. of 50%) were not readily distinguished. 

The EXON TAGGER analysis was repeated, using a subsequent genome assembly, 
the November 2002 human genome assembly, and another probe set was designed to a 

25 different 50mer candidate sequence but still within the DSCR6 gene. Assays were carried 
out as described above and in Example 1. 

The results comparing the performance of the initial and redesigned oligonucleotide 
sets for detecting a region in the DSCR 6 gene are presented in Figure 10. These results 
indicate that the INVADER assay designed to target a unique sequence in the DSCR 6 gene 

30 can readily discriminate small differences such that samples containing 2 or 3 copies of the 
DSCR 6 can be distinguished from one another. 
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B. INVADER Assay Design Screen 

Success of INVADER assay designs was based on a clear distinction of disomy and 
trisomy (or other anomalous karyotypes, as for the X and Y chromosomes) samples, 
background counts of less than 250, and signal over background values of at least 1.5 using 
5 100 ng of gDNA. Four out of 14 designs (29%) were considered successful, all to 

chromosome 21 and designed using the initial EXON TAGGER criteria were suitable for 
distinguishing disomic from trisomic samples. The EXON TAGGER parameters were then 
changed to include a repetitive element screen and the megaBLAST criteria were set to 
accept only sequences with <=80% homology to another region in the human genome. 
1 0 These criteria were implemented for the EXON TAGGER data generated for the 

chromosomes X,Y,13 and 18. Applying the revised criteria, 14 out of 15 subsequent designs 
(93%) were deemed to be successful. 

EXAMPLE 3 

1 5 Measurement Of Gene Dosage In The Presence Of Mock Maternal 

Genomic DNA Contamination 

In certain applications, it may be desirable to measure gene dosage in certain 
chromosomal samples in the presence of contaminating chromosomal samples from a 
different source or of a different type, for example fetal or embryonic samples in the 

20 presence of contaminating maternal material. To this end, the impact of contaminating 
disomic DNA, e.g. from a normal individual, on the measurement of chromosome copy 
number of disomic or trisomic samples was assessed. 

The results are presented in Figure 1 1 A-D. Four different loci on chromosome 21 
were used as targets to test the effects of combining disomic genomic DNA with trisomic 

25 (for chromosome 21) genomic DNA. Reactions were set up and carried out as in Example 
1 . In each case, the percentage of disomic and trisomic genomic DNA was varied from 0- 
100%, with the total DNA added to the reaction limited to 50 ng or 100 ng. Normalized 
ratios elevated significantly above 2 indicate the presence of aneuploid samples, i.e. samples 
containing an additional copy of the genes examined. These results indicate that the 

30 presence of trisomic DNA can be detected in a significant background of disomic or 
"contaminating" DNA. 

Example 4 

Measurement of Gene Dosage using two Invader Assays per Targeted Chromosome 
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In certain applications, it may be desirable to measure gene dosage in a sample by 
using two Invader assays per targeted chromosome and two internal control assays, thus 
providing double coverage on each of the chromosomes tested. 

5 

A. INVADER assay reagents and methods 

Figures 12 and 3 list the genes on each chromosome targeted for analysis and the 
oligo nucleotide sequences of the INVADER and probe oligonucleotides used to detect the 
various genes. In this example, Invader probe sets were designed to 2 target regions per 

10 chromosome. The target specific probes for chromosomes 13, 18, 21, X and Y contained 
the same arm (arm 1 , CGCGCCGAGG; SEQ ID NO: 201) and utilized the corresponding 
FAM FRET cassette (SEQ ED NO. 421). The internal control probes for chromosome 1 
contained the same arm (arm 3, ACGGACGCGGAG; SEQ ID NO: 202) and utilized the 
RED FRET cassette (SEQ ID NO. 200). INVADER assays were set up to determine 

15 chromosome copy number as follows. Target DNA was provided as genomic DNA 

prepared as described above. Biplex INVADER reactions (e.g. as shown in Figure 1), in 
which the chromosome-specific oligonucleotide sets were used with oligonucleotides 
directed to a internal controls (e.g. a portion of the a-actin (ACTA1) gene and HIST2HBE), 
were carried out in a final volume of 10 /xl in a 96-well microplate. The chromsomes and 

20 genes targeted, their chromsomal location ("cytoband"), and the Genbank accession 
number consulted for assay design are listed in Figure 12. 

Aliquots of 5 /xl of each sample (genomic DNA, final amounts ranging between 3 ng 
to 30ng per reaction) or no target control (100 ng//xl tRNA) were added to the appropriate 
wells and then overlaid with 15 /xl mineral oil. Samples were denatured at 95°C for 5 

25 minutes and then cooled to either 63°C or 75°C. A 5 /il aliquot of the following INVADER 
reaction mix was then added to each well and mixed by pipetting: 



Component 


Amount per 
reaction 


Final 

concentrations 
(in lOjil reaction) 


DNA reaction buffer 1 (14%PEG, 40 mM MOPS, pH 
7.5, 56 mM MgCl 2 , 0.02% ProClin 300) 


2.5 /xl 


3.5% PEG, 10 mM 
MOPS, 14mM 
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MgCl 2 


Chromosome Specific Primary probe/INVADER 
oligo/FAM FRET SEQ ID NO: 199[Fam-TCT-Z28- 
AGCCGGTTTTCCGGCTGAGACCTCGGCGCG-hex] 
(5 jiM/0.5 /zM/2.5jiM) 




0.5 \iM probe, 
O.OSjiM Invader oligo, 
0.25nM FRET 


Internal Control Primary probe/INVADER oligo/RED 

FRET SEQ ID NO: 200[Red-TCT-Z28- 

TCGGCCTTTTGGCCGAGAGACTCCGCGT 

CCGT-hex] 

(5 uM/0.5 nW2.5yM) 


1 fll 


0.5nM probe, 
0.05jiM Invader oligo, 
0.25^M FRET 


CLEAVASE X enzyme (40ng/fi\) in CLEAVASE 
dilution buffer 


0.5 /xl 


2ng/^l 



Table 4: INVADER assay reaction components 



Reactions were incubated at 63°C for 4 hours and then cooled to 4°C prior to 
5 scanning in a CYTOFLUOR 4000 fluorescence plate reader (Applied Biosystems, Foster 
City, CA). The settings used were: 485/20 nm excitation/bandwidth and 530/25 nm 
emission/bandwidth for FAM dye detection, and 560/20 nm excitation/bandwidth and 
620/40 nm emission/bandwidth for RED dye detection. The instrument gain was set for 
each dye so that the No Target Blank produced between 100 - 250 Relative Fluorescence 

10 Units (RFUs). Some of the microplates were also read in the Genios FL Plate reader (Tecan, 
Research Triangle Park, NC). The settings used were: 485/535 nm excitation/emission for 
FAM dye detection, and 560/612 nm excitation/emission for RED dye detection.The 
instrument gain was set for each dye so that the No Target Blank produced between 1000 - 
2000 Relative Fluorescence Units (RFUs). NOTE: Because the optimal gain setting can 

1 5 vary between instruments, adjust the gain as needed to give the best signal/background ratio 
(sample raw signal divided by the No Target Control signal) or No Target Control sample 
readings. For directly reading the microplates, the probe height of, and how the plate is 
positioned in, the fluorescence microplate reader may need to be adjusted according to the 
manufacturer's recommendations. 

20 The raw data that is generated by the device/instrument is used to measure the assay 

performance (real-time or endpoint mode). The equations below provide how FOZ (Fold 
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Over Zero), and other values are calculated. NTC in the equations below represents the 
signal from the No Target Control. 

FOZ or Signal/No Target 

5 FOZ D yei = (RawSignal D yei/NTCDyei) 
FOZ Dye 2 = (RawSignal D y e 2/NTC D ye2) 

In the following examples, FOZ Dyc i corresponds to the signal from the chromosome-specific 

assays, and FOZDye2> to that from the internal control assays. The two FOZ values (i.e. 

chromosome-specific and internal control) for each sample were used to calculate the 

10 chromosome-specific: internal control Ratio as follows: 

Ratio = (Net chromosome specific FOZ) 
(Net internal control FOZ) 

where Net FOZ = FOZ - 1 

15 To determine the ratio normalized to two copies per genome, the following calculation is 
performed with the Net FOZ ratios. 

Normalized Ratio= (Ratio/Control 2-copy ratio) X2 

20 where the control is a normal female genomic DNA sample or pool of normal female 
gemomic DNA samples. To determine normalized ratio values to one copy per genome 
(i.e.Chromosome Y), normal male genomic DNA is used as the control and the ratio is 
multiplied by 1. 

25 B. Limit of Detection (LOD) of INVADER assays 

Experiments were conducted to examine the effect of varying DNA concentration on 
assay performance and the ability to discriminate genotypes. In particular, the effects of 
limiting (i.e., 3 ng) genomic DNA were addressed. 

INVADER assays were set up as described in Example 4A using DNA samples 
30 described in Example 1 A. The CC1/CC3 gDNA was isolated from mosaic samples obtained 
from Coriell (GM13166 and GM03543) that contained either 30% 45(X), 70% 46 (XiX) or 
40% 45 (X), 60% 46 (XiX) respectively. The final amounts of genomic DNA ranged from 
3-30 ng. DNA was quantified using the PICOGREEN test (Molecular Probes, Eugene OR). 
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Assays were designed to 4 different genes on chromosome X. Two of the genes are located 
on the Xp arm PFKFB1 (SEQ ID NOs: 53 and 152); PCTK1 (SEQ ID NOs: 87 and 186), 
and two of the genes are located on the Xq arm MTMR8 (SEQ ID NOs: 82 and 181); 
FLJ21 174 (SEQ ID NOs: 84 and 183). The internal control genes used in this example 
were ACTA1 (SEQ ID NOs: 1 and 100); and HIST2HBE (SEQ ID NOs: 10 and 109). The 
results are presented in Figure 13 and show the Invader assays can distinguish 1, 2 and 3 or 
greater copies of X across the range of DNA concentrations tested with the exception of the 
3ng genomic DNA samples. Samples that generated FOZ values less than 1.4 or fell into the 
equivocal zones were designated as No Calls. Many of the samples containing 3 ng of 
genomic DNA failed to yield FOZ values above 1.4, which were therefore excluded from 
consideration as being below the limit of robust detection. DNA concentrations greater than 
30 ng were not tested in this experiment. The recommended amount of DNA is lOng for the 
assays that contain 2 assays per targeted chromosome. 

The methods described for Figure 13 were used to evaluate the assay performance 
for the other targeted chromosomes (13, 18, 21, Y). Samples that did not call correctly 
(false negatives or false positives) were labeled as Miscalls. 

The methods described for Figure 13 were used to evaluate the assay performance 
for the other gene targets on chromosomes 13, 18, 21, and Y. The gene targets are listed in 
Figure 12. The results of the assay performance are shown in Table 5. 



Chrom 



Assays 



10ng DNA/reaction 
N (total) No Call Miscall 



5-20ng DNA/reaction 
N (total) No Call Miscall 



3-30ng DNA/reaction 
N (total) No Call Miscall 



13 
18 
21 
Xp 
Xq 
Y 
Total 



DLEU1+ PCDH9 

FU23403+CN2 

NRIP1+HLCS 

PFKB1+PCTK1 

MTMR8+FLJ21174 

SRY+PRKY 



243 
243 
243 
418 
419 
420 
1986 



6 
1 

6 
2 
1 
5 
21 



354 
354 
354 
529 
530 
531 
2652 



9 
1 
8 
3 
4 
7 

32 



462 
462 
462 
637 
638 
639 
3300 



13 
3 
12 
12 
21 
8 

69 



2 
2 
7 
0 
0 
2 
13 



Table 5. Summary of Assay performance with varying DNA concentrations 



Many of the samples containing 3 ng of genomic DNA failed to yield FOZ values 
above 1.4, which were therefore excluded from consideration as being below the limit of 
robust detection. DNA concentrations greater than 30 ng were not tested in this experiment. 
The recommended amount of DNA is lOng for the assays that contain 2 assays per targeted 
chromosome. 
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C. Assay performance with Sample Mixtures 

Experiments were conducted to examine the effect of sample mixtures on assay 
performance and the ability to measure gene dosage (i.e. fetal or embryonic samples in the 
presence of contaminating maternal material). To this end, the impact of contaminating 
5 disomic DNA, e.g. from a normal individual, on the measurement of chromosome copy 
number of disomic or trisomic samples was assessed. 

Figure 14 shows the results from the chromosome 18 Invader assay using gDNA 
samples of mixed content. The chromosome 18 assay targeted FLJ23403 (SEQ ID NOs: 47 
and 146) and CN2 (SEQ ID NOs: 74 and 173). The internal control genes used in this 

10 example were ACTA1 (SEQ ID NOs: 1 and 100); and HIST2HBE (SEQ ID NOs: 10 and 
109). Reactions were set up and carried out as described in Example 4 section A. In this 
example, trisomy 1 8 gDNA samples were mixed with 0, 10, and 20% (disomy gDNA, e.g. a 
20% disomy contaminated sample (lOOul) contained 20ul of 2ng/ul disomy gDNA and 80ul 
of 2ng/ul trisomy gDNA. 5ul of the mixed content gDNA sample (lOng total) was added to 

15 the Invader assay. Samples that generated normalized ratio values greater than 2.5 were 

called 3 copy, samples with normalized ratio values between 1.6 and 2.3 were called 2 copy. 
Samples with normalized ratio values between 2.3 and 2.5 were equivocal (no call samples). 
These results indicate that the presence of trisomic DNA can be detected in a significant 
background of disomic or "contaminating" DNA. 

20 The methods described for Figure 14 were used to evaluate the assay performance 

for the other gene targets for chromosomes 13, 21, X and Ylisted in Figure 12. Aneuploid 
samples were mixed with 10, 20 and 30% of a normal sample to mimic varying degrees of 
maternal cell contamination. Results from these experiments indicate that the Invader assays 
can detect numerical abnormalities for Chromosomes 13, 18, 21, X and Y with 99% or 

25 greater accuracy in the sample mixtures tested. 

Example 5 
Analysis of Triploidy Samples (69, XXY) 

30 The results of the SRY (Ypl 1 .31) assay (SEQ ID NOs: 33 and 132) biplexed with 

the alpha-actin internal control are presented in Figures 15. In this example, 40 genomic 
DNA samples (25ng/rxn) were tested including various samples obtained from individuals 
presumed to be normal males or females (46, XY or 46, XX), as well as various aneuploidy 
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cell line samples obtained from Coriell including (48, +18, XXX;GM03623 48, XXXX; 
GM01416E and (47,XYY GM01250A and GM09326 ) and fourdifferent triploidy cell lines 
(69, XXX; GM07744, GM10013 or 69, XXY;AG05025, AG06266). 

Normalized ratios for the SRY assay are plotted along the X axis, while the FFOZ 
5 for each assay are plotted along the Y axis. Samples that had a RFOZ of at least 1 A were 
considered valid. The Normalized Ratio* for each sample was generated by dividing the 
Ratio of each sample by the Ratio of a presumed diploidy male control (obtained from 
Novagen, cat #70572) and multiplying by 1. Samples were determined to contain 0, 1 or 2 
copies of chr. Y based on preliminary estimations for potential copy ranges. Using this 

10 method of analysis, the Normalized Ratios for the 69, XXY samples fall within the 

proposed range for 1 copy Y samples, although the 69, XXY samples do not cluster with the 
presumed normal male population (46, XY). In order to further distinguish the 69, XXY 
samples from the 46, XY population, an inverse ratio calculation can be used (divide the 
alpha actin RFOZ by the chr. Y FFOZ to generate the ratio) and calculate the normalized 

15 ratio using the inverse ratios (inverse ratio of unknown sample divided by the inverse ratio 
of the presumed normal male control sample multiplied by 2. These results demonstrate 
that using inverse ratios for the chr. Y assay to generate the normalized ratios may be used 
to determine whether or not a sample has the karyotype of 69, XXY. 

20 All publications and patents mentioned in the above specification are herein 

incorporated by reference as if expressly set forth herein. Various modifications and 
variations of the described method and system of the invention will be apparent to those 
skilled in the art without departing from the scope and spirit of the invention. Although the 
invention has been described in connection with specific preferred embodiments, it should 

25 be understood that the invention as claimed should not be unduly limited to such specific 
embodiments. Indeed, various modifications of the described modes for carrying out the 
invention that are obvious to those skilled in relevant fields are intended to be within the 
scope of the following claims. 
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